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(57) Abstract 

The present invention relates to methods for identifying nucleic 
acid sequences which encode two or more specific interacting peptides 
or proteins. Furthermore, the present invention relates to kits which 
may be used for the identification of nucleic acid sequences in 
accordance with the method of the present invention. 
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NOVEL METHOD FOR THE IDENTIFICATION OF NUCLEIC ACID SEQUENCES 
ENCODING TWO OR MORE INTERACTING (POLY)PEPTIDES 

The present invention relates to methods for identifying nucleic acid sequences which 
encode two or more specific interacting peptides or proteins. Furthermore, the present 
invention relates to kits which may be used for the identification of nucleic add 
sequences in accordance with the method of the present invention. 

Protein-protein interactions play an important role in all biological processes, from the 
replication and expression of genes to the morphogenesis of organisms (Lewin, B. 
1994, Genes V. Oxford University Press). Methods for detecting protein-protein 
interactions have proved useful in understanding the basic mechanisms of different 
biological processes and the development of therapeutics. Detection of protein-protein 
interactions can be divided into two main categories: (i) physico-chemical based and (ii) 
genetic approaches (Phizicky, E.,M. & Fields, S. Microbiological Reviews £9 (1995) 94- 
123). Detection of protein-protein interactions by physico-chemical methods usually 
requires significant amounts of material, and more importantly, the identity of the 
proteins to be studied must be known. Recent developments in methods of mass 
spectrometry circumvent this problem but such suffer the disadvantage of requiring 
sophisticated equipment and expertise (Wang, R. & Chait, B.T., Current Opinion in 
Biotech. £ (1994) 77-84). In contrast, genetic approaches provide an easy and powerful 
method of identifying protein-protein interactions without the need for pure material and 
specialized equipment, with the added advantage of higher throughput. 


Different genetic approaches have been used to identify protein-protein interactions. 
The current method of choice is the yeast 2-hybrid system (Fields, S. & Song, O.K., 
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In spit of the power of all the aforementioned genetic selection approaches, th y are 
limited to the selection of interacting binding entities from only a singl genetically- 
diverse population (library vs. individual). 

It would, however, be highly desirable to simultaneously identify binding entities and 
their specific binding partners in a library vs. library setting, wherein preferably at least 
two genetically diverse populations are involved. A solution to this technical problem, i.e. 
the identification of interacting entities and the respective nucleic acid sequences from 
more than one genetically diverse population (library vs. library) is neither provided nor 
suggested by the prior art. The present invention solves the above technical problem by 
providing the embodiments characterized in the claims. By using these embodiments, it 
has become possible to increase exponentially the rate at which (poly)peptide- 
(poly)peptide interactions are detected. The present invention may find applications in 
the field of functional genomics, whereby different proteins of unknown functions can be 
related with other proteins. 

Accordingly, the present invention relates to a method for identifying a plurality of 
nucleic acid sequences, said nucleic acid sequences each encoding a (poly)peptide 
capable of interacting with at least one further (poly)peptide encoded by a different 
member of said plurality of nucleic acid sequences, comprising the steps of: 

(a) providing a first library of recombinant vector molecules containing 
genetically diverse nucleic acid sequences comprising a variety of nucleic 
acid sequences encoding (poly)peptides; 

(b) providing a second library of recombinant vector molecules containing 
genetically diverse nucleic acid sequences comprising a variety of nucleic 
acid sequences encoding (poly)peptides capable of interacting with further 
(poly)peptides as mentioned in step (a), wherein the vector molecules 
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employed for the production of said recombinant vector molecules and/or 
the recombinant inserts display properties that are phenotypically 
distinguishable from those of the vector molecules and/or the recombinant 
inserts used in step (a) and wherein at least one of said properties 
displayed by each of said vector molecules and/or the recombinant inserts 
used in steps (a) and (b), upon the interaction of a (poly)peptide from said 
first library with a (poly)peptide from said second library together generate 
a screenable or selectable property; 

(c) optionally, providing additional libraries of recombinant vector molecules 
containing genetically diverse nucleic acid sequences comprising a variety 
of nucleic acid sequences encoding (poly)peptides capable of interacting 
with or causing interaction of (a) further (poly)peptide(s) as mentioned in 
step (a) and/or step (b). wherein the vector molecules employed for the 
production of said recombinant vector molecules and/or the recombinant 
inserts display properties that are phenotypically distinguishable from 
those of the vector molecules and/or the recombinant inserts used in steps 
(a) and (b) and, optionally, at least one of said properties displayed by said 
vector molecule and/or the recombinant inserts used in step (c) together 
with at least one of said properties displayed by either said vector 
molecule and/or said recombinant insert used in steps (a) and/or (b), upon 
the interaction of a (poly)peptide from said additional library with either a 
(poly)peptide from said first library and/or a (poly)peptide from said second 
library generate a screenable or selectable property; 

(d) expressing members of said libraries of recombinant vectors or nucleic 
acid sequences mentioned in steps (a), (b) and optionally (c), in 
appropriate host cells so that at least one interaction is established; 

(e) selecting for the generation of said screenable or selectable property 
representing the interaction of said (poly)peptides; 
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(f) optionally, carrying out further selection, screening and/or purification 
steps; and 

(g) identifying said nucleic acid sequences encoding said (poly)peptides. 

Thus, in the context of the present invention, the term "properties that are phenotypically 
distinguishable" relates alternatively to properties that are encoded by the vector 
molecule or to properties that are encoded by the recombinant insert or to both types of 
properties. As regards the vector-encoded properties, these may e.g. be resistance 
markers or requirements for special nutrients. It should be noted that the recombinant 
insert may comprise a nucleic acid portion encoding said property in addition to the 
nucleic acid portion responsible for the interaction. 

In the context of the present invention, the term "different member" denotes a different 
entity which may be, but is not necessarily, structurally different. 

Further, in the context of the present invention, the term "plurality" bears the meaning of 
"at least two". 

The novel properties generated by the at least two recombinant inserts reflect the 
inventive principle of the present invention. That is, only if two (or more) (poly)peptides 
interact, for example, in a homo-dimeric or hetero-dimeric fashion, a screenable or 
selectable property is generated. The interaction between the two or more molecules 
may be a direct one or may be mediated indirectly. Examples for a direct interaction are 
the binding of an antibody encoded by a nucleic acid sequence from library 1 to a cDNA 
protein from library 2, the binding of a protein encoded by a nucleic acid sequence from 
cDNA library 1 to a protein from a cDNA library 2, as well as of an anti-idiotypic antibody 
encoded by a nucleic acid sequence from one of the libraries to a corresponding 
antibody encoded by a nucleic acid sequence from the other library. The nucleic acid 
sequences are preferably DNA and most preferably genes or parts thereof. 
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An example of an indirect interaction is the bridging of two (poly)peptides encoded by 
the two libraries which is mediated by a phosphorylating nzyme. One the 
phosphorylation of one (poly)peptide encoded e.g. by library 1 is effected by the 
respective kinase, then this protein is capable of interacting with the second 
(poly)peptide encoded by library 2. The phosphorylating enzyme exemplifying this type 
of interaction may be encoded by a nucleic acid from (one of) the additional libraries 
and/or may be encoded by the genome of the host cell. Typically, the interaction of the 
two (poly)peptides forms a "bridge" of molecules, said "bridge" being detectable using 
an appropriate detection process. Conveniently, said bridge is detectable by a tag 
molecule that is associated with, encoded by or attached to one of the (poly)peptides 
encoded by library 1 or preferably 2. 

Furthermore, the present invention relates to a method for identifying a plurality of 
nucleic acid sequences, said nucleic acid sequences each encoding a (po(y)peptide 
capable of interacting with at least one further (poly)peptide encoded by a different 
member of said plurality of nucleic acid sequences, comprising the steps of: 

(a) expressing in appropriate host cells 

(aa) nucleic acid sequences contained in a first library of recombinant 
vector molecules containing genetically diverse nucleic acid 
sequences comprising a variety of nucleic acid sequences encoding 
(poly)peptides; 


(ab) nucleic acid sequences contained in a second library of recombinant 
vector molecules containing genetically diverse nucleic acid 
sequences comprising a variety of nucleic acid sequences encoding 
(poly)peptides capable of interacting with further (poly)peptides as 
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mentioned in step (aa), wherein the vector molecules mployed for 
the production of said recombinant vector molecules and/or th 
recombinant inserts display properties that ar phenotypically 
distinguishable from those of the vector molecules and/or the 
recombinant inserts used in step (aa) and wherein at least one of 
said properties displayed by each of said vector molecules and/or the 
recombinant inserts used in steps (aa) and (ab), upon the interaction 
of a (poly)peptide from said first library with a (poly)peptide from said 
second library together generate a screenable or selectable property; 

optionally, nucleic acid sequences contained in additional libraries of 
recombinant vector molecules containing genetically diverse nucleic 
acid sequences comprising a variety of nucleic acid sequences 
encoding (poly)peptides capable of interacting with or causing 
interaction of (a) further (poly)peptide(s) as mentioned in step (aa) 
and/or step (ab), wherein the vector molecules employed for the 
production of said recombinant vector molecules and/or the 
recombinant inserts display properties that are phenotypically 
distinguishable from those of the vector molecules and/or the 
recombinant inserts used in steps (aa) and (ab) and, optionally, at 
least one of said properties displayed by said vector molecule and/or 
the recombinant inserts used in step (ac) together with at least one of 
said properties displayed by either said vector molecule and/or said 
recombinant inserts used in steps (aa) and/or (ab), upon the 
interaction of a (poly)peptide from said additional library with either a 
(poly)peptide from said first library and/or a (poly)peptide from said 
second library generate a screenable or selectable property; 

so that at least one interaction is established; 
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(b) selecting for the generation of said screenable or selectable property 
representing the interaction of said (poly)peptides; 

(c) optionally, carrying out further screening, selection and/or purification 
steps; and 

(d) identifying said nucleic acid sequences encoding said (poly)peptides. 


In a preferred embodiment of the method of the present invention, said screenable or 
selectable property is expressed extracellulariy. 

This embodiment is conveniently employed in a number of laboratories which would 
make use of rather conventional methodology of the extracellular detection of such 
properties, e.g. by column chromatography wherein the e.g. screenable tag is retained, 
in combination with e.g. plaque purification techniques, which allow the further 
purification of the cells that were originally enriched by e.g. the column chromatography 

step. 

In a further preferred embodiment of the method of the present invention, said 
recombinant vector molecule in step (a)/(aa) (the step identified after the slash refers to 
the corresponding step of the second embodiment of the method of the invention 
identified hereinabove) gives rise to a replicable genetic package (RGP) displaying said 
(poly)peptides at its surface. In this context, the term replicable genetic package (RGP) 
refers to an entity, such as a virus or bacteriophage, which can be replicated following 
infection of a suitable host cell. In the case of bacteriophage, for example, the collection 
of nucleic acid sequences can be inserted into either a phage or phagemid vector in 
frame with a component of the phage coat, such as gene III, resulting in display of the 
encoded binding entities on the surface of the phage. Particularly preferred as a 
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In a most preferred embodiment of the method of the present invention, said RGP is 
encoded by said recombinant vector used in step (a)/(aa) and rendered non-infective 
and infectivity of said RGP is restored by interaction of said (poly)peptide of step (a)/(aa) 
with the (poly)peptide of step (b)/(ab) and/or (c)/(ac), said (poly)peptide of step (b)/(ab) 
and/or (c)/(ac) being fused to a domain that confers infectivity to said RGP. 

In a further most preferred embodiment of the method of the invention, said RGP is 
rendered non-infective by modification of a genetic sequence which encodes a surface 
protein necessary for the RGP's binding to and infection of a host cell. 

These preferred and most preferred embodiments of the method of the present 
invention relating to the infectivity of the RGP serve as an alternative to the use of the 
screenable tag. In these embodiments, advantage can be taken of the phenomenon of 
selective infection (Krebber et al., FEBS Letters 3ZZ (1995) 227-239). While the 
screenable tag enables physical separation of molecules from others in the population, 
the use of selective infection enables positive selection for the interacting pair. This 
phenomenon relies on the use of a construct which can selectively restore infectivity to 
phage which have been rendered non-infective by, for example, deletion of all but the 
C-terminus of the gene III protein. Use of such phage for displaying library 1 gives non- 
infectious phage canying the binding entity. Co-expression with library 2 allows 
interactions between binding entities and binding partners to be established, as 
described above. Although the phage which carry the binding entity-binding partner pair 
are non-infective, infectivity can be restored if, in place of the screenable tag referred to 
above, an infectivity protein is used. In this context, the term infectivity protein refers to a 
substance which, when associated with the phage, can enable it to penetrate a bacterial 
host, where it is subsequently replicated. An example of an infectivity protein is the N- 
terminus (at least the first 220 amino acids) of gene III protein of the filamentous 
bacteriophage. 
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Th infectivity protein confers on those phage which carry it, the ability to b replicated. 
Thus, only those phage which carry the binding entity/partner pair are replicat d. 
Purification of hybrid phage containing genes from both libraries 1 and 2 then relies e.g. 
on the use of two selectable markers as indicated above. The genes in the phage can 
then be identified using methodology well known to those skilled in the art. 

An additional preferred embodiment of the present invention relates to a method, 
wherein said recombinant vector molecules in step (a)/(aa) give rise to a fusion protein 
which is expressed on the surface of a cell, preferably a bacterium. 

These fusion proteins, upon interaction with a suitable binding partner from library 2 
connected e.g. with a screenable tag can be detected on the surface of host cells which 
may be, for example, bacteria, yeast, insect cells or mammalian cells. The display of 
fusion proteins on bacterial surfaces per se is well known in the art. Thus, lipoproteins 
(Lpp), outer membrane proteins A (OmpA), and flagella have been used to target 
antibodies and peptides to the cell surface of E.coli. Fuchs et al., Bio/Technology 2 
(1991) 1369-1372, WO93/01287, presented a single chain antibody on the surface of 
E.coli as a fusion protein with the N-terminus of the peptidoglycan-associated 
lipoprotein. The antibody was visualized by the binding of fluorescently labeled antigen 
and fluorescently labeled antibodies directed to the linker peptide of the displayed single 
chain antibody. Francisco et al., Proc. Natl. Acad. Sci. USA 90 (1993) 10444-10448, 
and Geoigiu, G. et al., WO93/10214, displayed antibodies on the E.coli surface by 
fusing the N-terminus of a single chain antibody to the C-terminus of OmpA while the N- 
terminus of OmpA was fused to the signal sequence and the first nine amino acids of 
Lpp. Binding of a fluorescently labeled antigen to the OmpA-antibody fusion protein was 
detected by FACS. Klauser (WO 95/17509) transferred the IgA protease system from 
Neisseria to E.coli to facilitate display of antibodies. Integration of the beta-domain of 
the IgA protease precursor into the outer membrane lead to the transport of the 
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protease domain across the membrane followed by autoproteolytic rel as into th 
medium. Antibodies linked to the beta-domain of IgA protease are therefore presented 
on the surface of bacteria. Further, Lu, Z. et al., Bio/Technology 12 (1994) 366-371, 
described a system for displaying peptides on the surface of the bacterium by fusing it 
to thioredoxin and the bacterial flagella, to screen for peptide mimics of the epitope for 
an anti-IL-8 antibody. 

The further identification of the desired nucleic acid molecule encoding the interacting 
(poly)peptides may then be effected by methods known in the art, e.g. by purifying host 
cells displaying a tag on their surface and further by antibioticum-based' selection 
techniques, DNA purification and sequencing. 

In a particularly preferred embodiment of the method of the present invention, said 
bacterium is Neisseria gonorrhoe or E.coli and said fusion protein consists of at least a 
part of a flagellum, lam B, peptidoglycan-associated lipoprotein or the Omp A protein 
and said (poly)peptide. 

As has been repeatedly pointed out hereinabove, a tag connected to the (poly)peptide 
encoded by library 2 can conveniently be used in the identification strategy of the 
desired nucleic acid sequences. Accordingly, in a further preferred embodiment of the 
method of the invention, said (poly)peptides encoded by said recombinant vector 
molecules of step (b)/(ab) or (c)/(ac) are linked to at least one screenable or selectable 
tag. In this context, the term screenable or selectable tag refers to a short sequence of 
amino acids which can be recognized and bound by a particular substance. Tags are 
commonly used for the purification of biomolecules: examples are His(n), where n » 4-6 
which can be bound either by Ni, or a specific antibody, and the flag and myc tags which 
are recognized by appropriate antibodies. In either of these cases, the tag can be 
encoded as a C-terminal fusion to all binding partners in library 2. In accordance with 
the present invention, the tag can be used to isolate e.g. the polyphage referred to 
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abov . Thus, the interaction between the phage-bound binding entity, and its interacting 
binding partner, establishes a connection between the phage particle and the 
screenable or selectable tag. This feature can be exploited in a step which relies on e.g. 
affinity chromatography to isolate the polyphage carrying the interacting molecules. In a 
final step, those polyphage which carry two distinct nucleic acid molecules and 
preferably genes (encoding binding entity and binding partner) can be separated from 
those carrying only one of the two genes e.g. by selection based on transduction or 
different selectable markers (e.g. antibiotic resistance) present in the individual 
genomes. In this way, the genes which encode the two interacting molecules can be 
identified. 

A most preferred embodiment of the present invention relates to a method wherein said 
screenable or selectable tag is encoded by said recombinant vector of step (b)/(ab) or 
(cy(ac). 

A further most preferred embodiment of the present invention relates to a method 
wherein said screenable or selectable tag is selected from the list His(n), myc, FLAG, 
malE, thioredoxin, GST, streptavidin, beta-galactosidase, alkaline phosphatase T7 gene 
10, Strep-tag and calmodulin. These screenable tags are all well known in the art and 
are fully available to the person skilled in the art. 

In an additional particularly preferred embodiment of the method of the invention, said 
screenable or selectable tag is encoded by the genome of the host cell. 

An example for this embodiment is an anti-Fc-receptor specific antibody that is 
expressed by the host cell and could function as an additional bridge in e.g. purification 
by column chromatography. Another example of this embodiment is an enzyme 
produced by the host cell that creates a tag such as a phosphorylation on (poly)peptides 
of the second library without destroying the interaction of (poly)peptides of step (b)/(ab) 
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with (a)/(aa) so that the modification caused by the enzyme is now the screenable or 
sei ctabl tag. 

In a further preferred embodiment of the method of the invention, said (poly)peptides 
encoded by the nucleic acid sequences of said additional libraries of step (c)/(ac) cause 
the interaction of said (poly)peptides of steps (a)/(aa) and (b)/(ab) via phosphorylation, 
glycosylation, methylation, lipidation or famesyiation of at least one of said 
(poly)peptides of steps (a)/(aa) and (b)/(ab). 

An additional preferred embodiment of the invention relates to a method wherein said 
host cells in step (d)/(a) are spatially addressable, and the nucleic acid sequences 
mentioned in step (g)/(d) are retrieved from the corresponding spatially addressable 
host cell. 

In the context of the present invention, the term "spatially addressable" refers to a 
situation where the individual cells harboring one of the potential combinations of 
members of the first, second and optionally additional libraries are identifiable by their 
relative position, e.g. by their position on a master plate. The screening or selection 
may, for example, be performed either with single clones derived from the master plate, 
or on a replica plate, thus maintaining the connection between the screenable or 
selectable property and the information contained in the host cell on the master plate. 

An additional preferred embodiment of the invention relates to a method wherein said 
screenable or selectable property is expressed intracellularly. 

Particularly preferred is a method wherein said screenable property is the 
transactivation of the transcription of a reporter gene such as beta-galactosidase, 
alkaline phosphatase or nutritional markers such as his3 and leu or resistance genes 
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giving resistanc to an antibiotic such as ampicillin, chloramphenicol, kanamycin, 
zeocin, neomycin, tetracycline, or streptomycin. 

Furthermore, use can be made of the yeast 2-hybrid system referred to hereinabove or 
the interaction trap system (Brent et al., EP-A 0 672 131) or of a prokaryotic version 
analogous to the above recited systems, utilizing the toxR system of Vibrio cholerae 
(Fritz, H.-J. et al., EP-A 0 630 968). It is within the skills of the person skilled in the art to 
combine further screening systems known in the art with the method of the present 
invention. 

In a further preferred method of the present invention, said recombinant vectors of step 
(a)/(aa), (b)/(ab) and (c)/(ac) comprise Recombination promoting sites and in said step 
(e)/(b) recombination events are selected for, wherein said nucleic sequences encoding 
said (poly)peptides of step (a)/(aa), said nucleic acid sequences encoding said 
(poly)peptides of step (b)/(ab) and optionally said nucleic add sequences encoding said 
(poly)peptides of step (c)/(ac) are contained in the same vector. In this approach, the 
two genes can be coupled in a single vector, and packaged in a phage of standard size, 
if appropriate recombination sites are incorporated in the vectors carrying libraries 1 and 
2. Again, the phage which carry both nucleic add sequences and genes are purified 
with the use of e.g. the screenable tag. If recombination is used to couple the genes 
from the two libraries, some of the hybrid progeny phage will contain nonrecombinant 
genomes, since site-specific recombination is not very efficient However, the hybrid 
phage can be selected by re-infection of host cells that do not contain library 2 followed 
by another round of selection of the screenable tag. 

In a particularly preferred embodiment of the method of the invention, said 
recombination events are mediated by the site-specific recombination mechanisms Cre- 
lox, attP-attB, Mu gin or yeast fip. 
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In a further particularly preferred embodiment of the method of the invention, said 
recombination promoting sites are restriction enzyme recognition sites and said 
recombination event is achieved by cutting the recombinant vector molecules mentioned 
in steps (a)/(aa), (b)/(ab) and optionally (c)/(ac) with at least two different restriction 
enzymes and effecting recombination of the nucleic acid sequences contained in said 
vectors by ligation. 

The invention relates in an additional preferred embodiment to a method wherein said 
identification of said nucleic acid sequences is effected after the selection step (e)/(b) 
via PCR and preferably sequencing of said nucleic add sequences after said PCR. 

After said selection step (e)/(b), PCR can be earned out-.with the enriched desired 
product, conveniently using primers that hybridize to the vector portion of the 
recombinant vector molecule. Sequencing of the PCR-product may then be carried out 
according to conventional methods. 

In a further preferred embodiment of the method according to the invention, said 
recombinant vectors of step (a)/(aa), (b)/(ab) and/or (c)/(ac) comprise at least one gene 
encoding a selection marker. 

Said genes encoding said selection markers are preferably different in each of the 
vectors of step (a)/(aa), (b)/(ab) and/or (c)/(ac), i.e. said vectors comprise genes 
encoding different selection markers. Said selection markers can conveniently be used 
for the further purification envisaged in step (f)/(c). For example, a polyphage 
comprising two members of each library 1 and 2 can be selected for on the basis of a 
double resistance to antibiotics. Also, a successful recombination event may create a 
new recombinant vector carrying both nucleic acid molecules from library 1 and 2 as 
well as genes encoding different selection markers. Again, the selection for a twofold 
resistance will assist in the identification of the desired product. 
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In a particularly preferred embodiment of said method, said selection marker is a 
resistance to an antibiotic, preferably to ampicillin, chloramphenicol, kanamycin, zeocin, 
neomycin, tetracycline or streptomycin. 

A further preferred embodiment of the present invention relates to a method wherein 
said host cells are P and preferably E.coli XL-1 Blue, K91 or its derivatives, TGI, 
XLIkan or TOPI OF. 

In a particularly preferred embodiment of the present invention, said RGPs are 
produced with the use of helper phage taken from the list R408, M13k07 and VCSM13, 
M13de13, fCA55 and fKN16 or derivatives thereof. 

Further preferred is a method wherein at least one of said genetically diverse nucleic 
acid sequences encode members of the immunoglobulin superfamily. 

Said method is particularly preferred, if said genetically diverse nucleic acid sequences 
encode a repertoire of immunoglobulin heavy or light chains. 

In an additional preferred embodiment of the present invention, in said method said 
genetically diverse nucleic acid sequences are generated by a mutagenesis method. 
Various mutagenesis methods are well known to the person skilled in the art and need 
not be described in here in any further detail. 

The present invention relates in an additional preferred embodiment to a method in 
which said genetically diverse nucleic acid sequences are generated from a cONA 
library. 

In a final preferred embodiment of the method of the invention, said nucleic acid 
sequences are genes or parts thereof. 
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As used herein, the term "parts ther of relates to parts of g nes that encode a product 
that is capable of interacting with a product encoded by any of the other libraries. Thus, 
it is well known that various proteins are comprised of diff rent domains. Only one of 
said domains may be capable of interacting with a different (poly)peptide. Such a 
domain might be encoded by a part of said gene in accordance with the present 
invention. 

The invention also provides for identifying genes encoding more than two interacting 
peptides or proteins. This can be achieved by using additional vectors encoding 
genetically diverse additional nucleic acids by an extension of the method described 
above. As previously, the presence of either a screenable tag or an infectivity protein is 
used to purify phage carrying genes which encode the components of the complex. 
Again, the genes in the phage can then be sequenced using methodology well known to 
those skilled in the art. 

Additionally, the present invention relates to a kit comprising at least 

(a) a recombinant vector molecule as described in step (a)/(aa) or a corresponding 
vector molecule; 

(b) a recombinant vector molecule as described in step (b)/(ab) or a corresponding 
vector molecule; and, optionally, 

(c) at least one further recombinant vector molecule as described in step (c)/(ac) or a 
corresponding vector molecule. 

As a rule, if recombinant vector molecules are comprised in said kit, they will comprise a 
library of nucleic acid molecules. In other words, the kit of the Invention will contain a 
plurality of different recombinant vector molecules. 
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Legends to Figures and Tables 

Figure 1: General description of th polyphage principl 

a) transform to E. coli hosts 

b) infect host containing libraryl with helper-phage to package libraryl 
into phage 

c) infect cells containing Iibrary2 with phages containing libraryl leading to 
cells harboring members of libraryl and Iibrary2; the presence of libraryl 
and Iibrary2 is selected by the presence of the 2 antibiotic resistance 
markers 

d) expression of libraryl and Ijbrary2-tag gene products 

e) infect cells with engineered helper-phage to induce polyphage 
production 

Note 1: Polyphage does not discriminate which genome to package 
therefore the possibilities resulting from step e) arise in an infected cell. To 
select for the polyphage containing the right packaged genomes the 
subsequent step is required 

f) select for tag e.g., infectivity-mediating protein, in which case ability to 
infect is selected and 

g) select for ability to confer resistance to 2 antibiotics to infected ceils 
Note 2: Only polyphages that satisfy f) + g) represent phages that display 
the correct interacting pair and the corresponding genetic information 

Figure 2: Co-transformation of two phagemids, polyphage formation and selection 
via His-tag: general description 

A, B: libraries of phagemids, preferably with different resistance markers; 
A: fusions to glllp; B: fusions to tag (His); after co-transformation phage 
production leading to a phage population displaying cognate pairs (left 
part of the Figure) or not (right part), after selection infection of host cells, 
selection for double-resistance 
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Alternative methods include the infection of cells harbouring a plasmid- or 
phagemid-based library B with a phage library A (prerequisit again: 
interference-resistant constructs). 

Figure 3: pBS vector series: functional map and sequence of pBSI3 
Figure 4: Co-existence of phagemids: results of restriction digest 

Restriction analysis of clones of double resistances (Amp/Cm). R1: 
plGIO.3, Xba/Scal 4 , R2: pBS13, Xba/Scat, R1+R2: R1 and R2 are mixed in 
approx, equal proportion; Ml : marker X: BsfEII; M2: marker pBR322: Msp\\ 

1 to 10: randomly picked clones: Xba/Scal 
Figure 5: Phagemid vector pYlNGI-CI: functional map 

containing the fos peptide. The corresponding vectors pYING1-C2 and 
pYING1-C3 contain instead of fos the p75 and the IL16 peptides, 
respectively 

Figure 6: Phagemid vector pYANG3-A: functional map 

containing the jun peptide. The corresponding vectors pYANG3-Ape2, 
pYANG3-Ape3, and pYANG3-Ape10 contain instead of jun the p75- 
binding peptides pe2, pe3, and pelO, respectively 
Figure 7: Analysis of selected clones (see Table 2): 

7.a: Restriction digest of clones before and after selection 
R: pYANG3-Ape2: XbaV, Ml: marker X: BsfEII; M2: marker pBR322: Msp\\ 
a/1 to 10: randomly picked clones before selection: XPal/H/ndlll; p/1 to 10: 
randomly picked clones after selection: Xial/H/ndlll; size expected, jun- 

_- g|||:.745 bp; fos: 256 bp; p75: 577 bp; IL-16: 502 bp 

7 b; pcR reaction of clones after selection with primers OPEP5L and 

OGIII3 

R1: pYANG3-A as template; R2: pYANG3-Ape2 as template; M: marker X: 
BsfEII; p/1 to 10: randomly picked clones after selection as templates 

Figure 8: Phagemid vector pINGI-CI: functional map 

containing the His-tag peptide. The corresponding vector plNG3-C1 
contains an additional FLAG epitope; plNG1-C2 and plNG3-C2 contain 
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the Strep-tag instead of His-tag, with plNG3-C2 containing an additional 
FLAG epitope. 

Phagemid vector pONG3-A functional map 

for the generation of phage-display libraries (gill fusions) 

Co-transformation of phage and plasmid, polyphage formation and 
selection via Sir: general description 

fA: library A in phage construct; B: library B, library members fused to IMP; 
preferably different resistance markers on phage and plasmid; after co- 
transformation production of phages; in the case of cognate-pair 
interaction formation of infectious phages; selection; by plating on double- 
resistance identification of polyphage particles. 

Phage vector fhagl A- functional map 

for phage-display of the a-HAG scFv 

CAT gene module: functional map and sequence 

Phage vector fjunlA* functional map 

for phage-display of the jun peptide 

Phage vector fjunIB: functional map 

for phage-display of the jun peptide 

Phage vector fpep3_1B: functional map 

for phage-display of the peptide pe3 binding to the intracellular domain of 
p75 

Phage vector fNGF_1B: functional map 

for phage-display of NGF - - 

Plasmid p(JC19/IMPhag: functional map 

containing fusion of HAG peptide to the N-terminal domains of glllp (IMP) 
Plasmid pUC18/IMPp75: functional map 

containing fusion of the intracellular domain of p75 to the N-terminal 
domains of glllp (IMP); pUC18/IMPfos contains the fos peptide instead of 
the intracellular domain of p75 
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Figure 18: 
Figure 19: 


Figure 20: 


Figure 21: 


Figure 22: 
Figure 23: 


Plasmid pUC18/IMPIL16: functional map 

containing fusion of IL16 to the N-terminal domains of glllp (IMP) 

Analysis of selected clones (see Table 3) 

Lane 1: marker X: BsfEII; lanes 2 to 20: polyphage transductant clones #1 
to #19 digested with Xba/Hind\\\: f.._1b: fragment of phage vector after 
digest; pUC18: fragment of plasmid after digest; a-HAG: fragment 
containing anti-HAG scFv fused to glllc; IMP-p75 and IMP-HAG: fragment 
containing IMP fused to p75, and IMP-HAG peptide, respectively; pep3- 
gllls: fragment containing pep3 fused to glllc (s: short version) - 

Co-transformation of phagemids, in vivo recombination and selection via 
His-tag: general description 

A, B: libraries of phagemids; preferably with different resistance markers; 
A: fusions to glllp; B: fusions to tag (His); both constructs containing 
recombination-promoting sites (*) such as lox/loxP; after co-transformation 
and recombination production of phages; selection via Ni-NTA; re-infection 
of host cells, selection for double-resistance 
In vitro recombination and selection via His-tag: general description 
A, B: libraries of phagemids; preferably with different resistance markers; 
A: fusions to glllp; B: fusions to tag (His); both constructs containing 
corresponding recognition sites for restriction enzymes (+/o); after digest 
and co-ligation transformation and production of phages; selection via Ni- 
NTA; re-infection of host cells, selection for double-resistance 
Phage vector fjunhag: functional map for phage display of the jun peptide. 
Spatial in vivo SIP: general description 

After transformation or co-transformation according to any of the methods 
described above, a master plate is made. From that phages secreted from 
individual clones can be analyzed individually (top), or a replica (migration 
of secreted phages through filter disc) can be made whereon selection for 
the presence of a tag or infectivity can be performed. By going back to the 
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master-plate, the information for selected cognate interacting pairs can be 
retrieved without requiring recombination and/or polyphage production. 
Figur 24: E. coli display: general description 

A, B: libraries of phagemids; preferably with different resistance markers; 
A: fusions to E.coli surface-display protein; B: fusions to tag (His); after co- 
transformation expression of constructs; surface-dispiay; in the case of 
cognate interaction taking place, display of tag on the surface of the host 
cell; selection 

Figure 25: pTERMsc2H10myc3sCAM: functional map and sequence 

Table 1: Phagemids constructed for Experiments 2 and 3 

Table 2: Results of Experiment 2 (see Figure 7) 

2.a: Combination of phagemids present in initial library (a) 

2. b: Combination of phagemids present after selection (P) 

Table 3: Results of Experiment 4 (see Figure 19) 

3. a: Identification of phage/plasmid present in individual clones 

3.b: Test for infectivity of individual clones 

The examples illustrate the invention. 


RECTIFIED SHEET (RULE 91) 
ISA/EP 



WO 97/32017 


PCT/EP97/00931 


24 - 

Exampl 1: General descripti n of the polyphage principl (Figure 1) 

* 

The binding entities which comprise library 1 may be peptides or proteins, and are 
encoded by a genetically diverse collection of first nucleic acid sequences. These 
nucleic acid sequences are inserted into a first vector which allows for display of the 
encoded binding entities on the surface of a replicable genetic package. For the 
purposes of subsequent selection, the first vector should also carry a gene encoding a 
selectable marker, such as an antibiotic resistance. The binding partners which 
comprise library 2 may be peptides or proteins, and are encoded by a genetically 
diverse collection of second nucleic acid sequences which are inserted into a second 
vector. By way of example, this second vector may be a plasmid, or even a phage or 
phagemid, in which case the origin of replication should be distinct from that of the first 
vector. For the purposes of subsequent selection, the second vector should also carry a 
gene encoding a selectable marker, such as an antibiotic resistance, preferably distinct 
from that present in the first vector. To facilitate purification of the complex to be formed 
between any binding entity-binding partner pair, a screenable tag can be conveniently 
attached to members of library 2. 

The two genetically diverse collections of nucleic acids are then introduced into a 
population of host cells in such a way that encoded libraries 1 and 2 can be expressed. 
This can be achieved by either (i) co-transformation of the two vectors, or, as actually 
shown in the figure, (ii) packaging one of the collections of nucleic acids into a vector 
(such as a bacteriophage) which can be used to infect with high efficiency a population 
of cells into which the complementary collection of nucleic acid has been introduced. 
The result is a population of cells in which individual cells carry representatives of each 
library. 

Expression of the two collections of nucleic acids results in the production of pairs of 
molecules, one from each library, in the host cells. In each case, one or more members 
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of the library of binding entities is incorporated into the coat of an RGP. In some c lls, 
an interaction will be established between a binding partner on the surface of the RGP 
and a binding partner expressed from library 2. When such an interaction is established, 
the RGP therefore carries both the binding entity and the binding partner. 

The RGPs displaying such an interaction can then be further purified with the help of 
polyphage and differing selection markers, as has been discussed hereinabove. After 
such selection, the nucleic acid sequences encoding one or both binding partners can 
be conveniently identified by methodology known in the art, such as DNA sequencing. 

Example 2: Co-transfomiatlon ofphagemidswlth same E. coll origin - 
of replication, polyphage formation, and selection of correct pairing 
interactions via His-tag 

2.1: Principle (see Figure 2) 

To demonstrate that polyphage formation allows the retrieval of the genetic information 
for cognate protein pairs selected using a tag fused to one member of the protein pair, 
two separate, small libraries in phagemid vectors are constructed. 

2.2: Test of co-existence of phagemids with the same E. coll origin of replication: 
Prerequisite for the formation of polyphage particles containing two different phagemids 
is that the different phagemid vectors can co-exist in the host cell. 

The vector pBS13 is a derivative of the vector (Krebber et al., 1996) containing a 
chloramphenicol-resistance gene instead of the kanamycin-resistance gene and a beta- 
lactamase gene cassette instead of the 2H10-glll fusion gene, and can be assembled 
by standard methods starting from pto2H10a3s. Figure 3 contains the functional map 
and the sequence of pBS13. pIGHAGIA (see Example 4.2.1.f) is digested with Xbal 
and H/ndlll. The 1.3 kb fragment containing the anti-HAG gene fused with the C- 
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terminal domain of filamentous phag pill protein is isolated and ligat d with a pr - 
dig sted phagemid vectors plG10.3, and pBS13 (Xbal-Hindll!) to create the vectors 
plG10.3-scFv(anti-HAG) (Ap R ) and pBS13-scFv(anti-HAG) (Cm R ), respectively. The 
vectors are used to transform competent XL-1 Blue cells and selected on LB plates 
containing Amp/Cm/Tet and glucose (20 mM). 

The phagemids from clones of double-resistant colonies (Amp/Cm) are isolated. The 
restriction digestions indicate the co-isolation of both phagemids from the single 
colonies (Figure 4). 

2.3: Design of libraries A and B: 

Library A contains three cyclic peptides each binding to the intracellular domain of the 
low affinity nerve growth factor (NGF) receptor (see Example 4), and a leucine zipper 
domain derived from the jun transcription factor, all N-terminally fused to the C-terminal 
domain of gill from filamentous phage. 

Library B encodes 3 members, namely the leucine zipper domain of the fos 
transcription factor which heterodimerizes with jun via this domain, the intracellular 
domain of the NGF receptor p75, and, as a negative control which does not interact with 

library A members, IL-16, all fused at the N-terminus with a Hisg-peptide as tag (Hochuli 

et al., 1988; Lindner ef a/., 1992). 

The cognate pairings are from the interaction between jun and fos (Crameri and Suter, 
1993), and p75 and selected cyclic peptides (see Example 4). A non-cognate pairing 
would occur among the non-cognate pairs mentioned and among jun, or one of the 
cyclic peptides, and IL-16. 

2.4: PCR amplification of the individual constructs 

Fos, N-terminus fused to Hisg, is PCR amplified using pOKI (Gramatikoff et al., 1994) 
as template and oligonucleotides OFOS-5 and OFOS-3 as primers, where Hisg is 
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ncoded in the OFOS-5 primer. Jun is PCR amplified using pOKI as template and 
oligonucleotides OJUN-5 and OJUN-3 as primers. 

OFOS-5 5’- GGGGATATCCACCACCACCACCACCACCTGCGGTGGTCTGACC 

OFOS-3 5’- GGGGA47TCCAACCACCGTGTGCCG 

OJUN-5 5’- GGGGA7ATCGGTGGTCGGATCGCC 

OJUN-3 5- GGG GAA 7TCACCACCGTGGTTCATGAC 

The hot-start procedure is used. A step-wise touch-down PCR is applied: 92°C, 1 min; 

58-52°C, AT = 2°C, 1 min; 72°C, 1 min. This is followed by 26 cycles (92°C, 1 min; 52°C 
1 min; 72°C, 1 min). 

Th PCR products are purified using QIAquick kit (Qiagen) and eluted in ddH 2 0. They 
are then overnight digested with EcoRI and EcoRV. 

The P 75 fragment is also PCR amplified using pUC18-IMPp75 (see Example 4) as 
t mplate and oligonucleotides OP75-5 (where Hisg is encoded) and OP75-3 as primers: 

OP75-5 5'- GGGGATATCCACCACCACCACCACCACAAGAGGTGGAACAGC 

OP75-3 5’- GGGGAA7TCCACTGGGGATGTGGCAG 

Th same PCR and restriction digestion conditions as above are applied. 

The IL-16 fragment is amplified from the cDNA clone pcDNA34LHu1 (M. Baier, Paul 
Ehrlich Institute. Germany; Baier at at., 1995; Banned at a/.. 1996). using OIL16-5 

(wh re Hisg is encoded) and OIL16-3 as primers. 

OIL16-5 5’- GGGGATATCCACCACCACCACCACCACCCCGACCTCAACTCCTC 

OIL16-3 5’- GGGGA47TCGGAGTCTCCAGCAGCTG 

The same PCR and restriction digestion conditions as above are applied. 

In all cases, the fragments are readily amplified and digested. 


2.5: Cloning into intermediate vectors 
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The phage particles are harv sted from the overnight cultur s, mixed and PEG- 
precipitated. The phages are directly selected on immobilized Ni-NTA (NI-NTA HisSorb 
Strips, Qiagen). The eluted phages are used to infect TGI cells, which are plat d on 
ampicillin/chloramphenicol-containing LB agar plates, and colonies with double 
resistance (Ap R /Cm R ) are selected. 

The phagemids of selected clones are isolated and analyzed by restriction digest (see 
Figure 7.a) and used as templates for PCR screening. Primer OPEP5L is used to 
amplify the pYANG3-Ape2, -Ape3 and -Apel 0 constructs specifically (see Figure 7.b). 
OPEP5L 5’- GACTACAAAGATGTCGACTG 
There is a specific enrichment of constructs of correct pairing (Table 2). 


Example 3: Interactive screening of E. coli genomic DNA libraries 
(Polyphage/tag system) 

3.1: Principle (see Figure 2) 

Instead of using two model libraries as in Example 2, a genomic DNA library of E. coli is 
prepared to be screened against itself to identify interacting E. coli peptides or proteins. 

3.2: Construction of display and expression vectors for genomic DNA 
Expression vectors are constructed having a blunt-end restriction site Smal inserted 
either in front of His-tag, Strep-tag (Schmidt and Skerra, 1994) or the C-terminal domain 
of gill (glllc) via oligonucleotide cassettes or PCR. 

The self-complementary oligonucleotides OHIS5 & OHIS3, and OSTREP5 & OSTREP3, 
are used to create ds DNA cassettes encoding the His-tag, and the Strep-tag, 
respectively. 

OHIS5 5’- AATTCCCCGGGCACCACCACCACCACCACTGATA 

OHIS3 5 - AGCTTATCAGT GGTGGTGGTGGTGGTGCCCGGGG 
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OSTREP5 5’- AATTCCCCGGGTCTGCTTGGCGTCACCCGCAGTTCGGTGGT- 

TGATA 

0STREP3 5’- AGCTTATCAACCACCGAACTGCGGGTGACGCCAAGCAGACC- 

CGGGG 

The cassettes upon phosphorylation and annealing recreate the EcoRI and H/ndlll sites. 
The cassettes are inserted into pIGI and plG3 vectors (Ge et al., 1995) cut by the same 
restriction enzymes. The resulting vectors are pINGI-AI, plNG3-A1 (for His tag in pIGI 
and plG3 vectors) and plNG1-A2, plNG3-A2 (for Strep-tag), respectively. The correct 
vectors are screened for the presence of Xma\ site (isoschizomer of Smal) and the 
constructs are confirmed by sequencing. The Xbal/H/ndlll fragments of these vectors 
are inserted into pBS13 vector, linearized with the same enzymes, resulting in vectors 
pINGI-CI, plNG3-C1 and plNG1-C2, plNG3-C2, respectively (see Figure 8). 

The glllc fragment containing the Smal site is generated from PCR amplification of 
pIGI0.3 vector using primers OGIII5 and OGIII3, where OGIII3 anneals 3' of the gene III 
in the vector. 

OGIII5 5 - CGGAA 7TCCCCGGGGAGCAGAAGCT GATC 

OGIII3 5 - TTTTT CACTT CACAGGT C 

Three rounds of PCR are performed with a hot-start: 92 C, 1 min; 46 C, 1 min; 72 C, 1.5 
min. This is followed by 30 rounds of: 92 C, 1 min; 50 C, 1 min; 72 C, 1.5 min. 

The PCR product is purified (QIAquick) and digested with EcoRI and H/ndlll. The 
fragment is gel-purified (QIAquick) and ligated into plG10.3. The sequence of the 
resulting vector, pONG3-A (see Figure 8), is confirmed by restriction analysis and by 
sequencing. 

3.2: Selection of Interacting Pairs from E. coll Genomic DNA via His-tag 
Genomic DNA of E. coli strain XL-1 Blue (Stratagene) is isolated using the Blood & Cell 
Culture DNA Maxi kit (Qiagen) and eluted in TE buffer (pH 8.0). 200 pg of the DNA is 
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tak n and sonicated (50 cycles, 270 mA, 0.5 s/stroke). The fragmented DNA (average 
siz : max. 0.7 kB) is blunt-ended by a fill-in reaction with T4 DNA polymeras . 

Vectors pINGI-CI and pONG3-A are digested with EcoRV and Smal, th vector 
fragments are gel-purified (Qiagen). The vector fragments are then ligated with the 
blunt-ended genomic DNA at 16°C overnight. The ligation mixtures are taken to 
transform TGI cells. 

The pINGI-CI and pONG3-A transformants are scratched from the plate and used to 
inoculate 2xYT medium containing Cm/glucose or Amp/glucose, respectively. The ; 
pINGI-CI culture is infected with helper-phage (VCSM13 or M13k07) and phage 
particles are isolated. These phage particles are used to infect log-phase cells 
containing the pONG3-A library. The resulting culture js plated out on large 
Amp/Cm/glucose plates. 

The colonies are scratched from the surface of the plates above and transferred to 2xYT 
medium containing Amp/Cm. After 30 min shaking at 37°C, the culture is then induced 
(1 mM IPTG) for 30 min, infected with helper-phage at 37°C for 30 min and shaken at 
RT overnight. 

The phage particles are harvested from the overnight culture and PEG-precipitated. 
They are selected on immobilized Ni-NTA (NI-NTA HisSorb Strips. Qiagen). The eluted 
phages are used to infect log-phase TGI cells. Selected protein pairs are characterised 
by determination of their corresponding DNA sequences. 

Example 4: Polyphages and Selection of Correct Pairing Interactions 
via SIP 

4.1: Principle (see Figure 10) 

The purpose of this experiment is to show that from a combination of 2 libraries one can 
isolate and identify the correct interacting pairs using the SIP (Selectively Infective 
Phage: Krebber et a/., 1995; the term "IMP" used in the experimental section denotes 
"Infectivity mediating particle” comprising the N-terminal domains of the gene III protein 
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of filamentous phage) selection system, and recover th information about both 
interacting partners via the formation and selection of polyphage particl s. The library 
members forming interacting pairs with members of th corresponding library ar being 
'doped' with library members that do not interact with members of the corresponding 
library, and thus should not give a positive SIP selection. 

4.2: Construction of vectors 
4.2.1: fhaglA (see Figure 11) 

a. The phage vector fl 7/9-hag (Krebber et al., 1995) is digested with EcoRV and Xmnl. 
The 1.1 kb fragment containing the anti-HAG Ab gene is isolated by agarose gel 
electrophoresis and purified with a Qiagen gel extraction kit. This fragment is ligated 
into a pre-digested plG10.3 vector (EcoRV-Xmnl). Ligated DNA is transformed into 
DH5a cells and positive dones are verified by restriction analysis. The recombinant 
clone is called pIGhaglA. All doning described above and subsequently are 
according to standard protocols (Sambrook ef a/., 1989) 

b. The vector f17/9-hag (Krebber ef a/., 1995) is digested with EcoRV and Stul. The 7.9 
kb fragment is isolated and self-ligated to form the vector fhag2. 

c. The chloramphenicol resistance gene (CAT) assembled via assembly PCR (Ge and 
Rudolph, 1997) using the the template pACYC (Cardoso and Schwarz, 1992) (Figure 
11a shows the functional map and the sequence of the CAT gene) is amplified by the 
polymerase chain reaction (PCR) with the primers: 

CAT_BspEI(for): 5’ GAATGCTCATCCGGAGTTC 

CAT_Bsu36l(rev): 5’ TTTCACTGGCCTCAGGCTAGCACCAGGCGTTTAAG 

d. The PCR is done following standard protocols (Sambrook of a/., 1989). The amplified 
product is digested with BspEI and Bsu36l then ligated into pre-digested fhag2 vector 
(BspEI-Bsu36l; 7.2 kb fragment) to form fhag2C. 

e. The vector fhag2C is digested with EcoRI and the ends made blunt by filling-in with 
Klenow fragment. The flushed vector is self-ligated to form vector fhag2CdelEcoRI. 
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f. plGHAGIA is digested with Xbal and Hindlll. The 1.3 kb fragment containing the anti- 
HAG gene fused with the C-terminal domain of filamentous phage pill prot in is 
isolated and ligated with a pre-digested fhag2CdeIEcoRI phage vector (Xbal-Hindlll; 
6.4 kb) to create the vector fhaglA 

4.2.2: fjunIA (see Figure 12) 

a. The EcoRV site of plG10.3 is converted to a Sail site by oligonucleotide site-directed 
mutagenesis (Sambrook et at., 1989) with primer 

Sall9-9primer(rev) 5’CT GAATGT CG ACATCTTT GTAGTC3' 

The mutated pIGI0.3 is called plG10.3 Sail. 

b. The jun leucine-zipper domain from pOKI (Grammatikoff et at., 1994) is amplified by 
PCR with the primers: 

jun2(for): 5'ACGCGTCGACGCCGGTGGTCGGATCGCCCGG3' 
jun2(rev): 5'AATTCGGCACCACCGTGGTTCATGACT3' 

c. The PCR is done following standard protocols (Sambrook et at., 1989). The amplified 
product is digested with Sail and EcoRI, then ligated into pre-digested plG10.3Sall 
vector (Sall-EcoRI) to form the vector Jun-plG10.3SalI. 

d. The vector jun-plG10.3Sall is digested with Xbal and EcoRI. The 0.14 kb fragment is 
ligated into the pre-digested vector fhaglA (Xbal-EcoRI; 7kb) to form the vector 
fjunIA. 

4.2.3: fjunIB (see Figure 13) 

a. The DNA encoding the C-terminal domain including the long linker separating it from 
the amino terminal domain of the filamentous phage pill (gill short) is amplified by 
PCR using pOKI (Grammatikoff et at., 1994) as template with the primers: 
gill short(for): 5'GCTTCCGGAGAATTCAATGCTGGCGGCGGCTCT3* 

gill short(rev): 5'CCCCCCCAAGCTTATCAAGACTCCTTATTACG3' 
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b. The PCR is done following standard protocols (Sambrook et al., 1989). Th amplified 
product is digested with EcoRI and HindlH, then ligated into pre-digested fhaglA 
vector (EcoRI-Hindlll) to form the vector fjunlB. 

4.2.4: fpep2_1 b, fpep3_1B, fpep10_1b (see Figure 14) 

a. These constructs are obtained from a peptide library screened against the 
intracellular domain of p75, the low affinity receptor of NGF, in a SIP experiment 

b. A peptide library cassette of cyclic peptides with length variants of 6-16 amino acids 
is prepared-from the oligos: 

Groprim: 5 , -CATGAATTCGGATCCTCC-3 , 

GronlO: 5 *-CTATGGCGCGCCTGTCGACTGT(M) 6 - 16 TGTGGTGGTGGAGGATC- 
CGAATTCATG-3* 

where M is a mixture of 19 trinucleotide codons (VimekSs et a/., 1994), excluding the 
one coding for Cys. The length variation is achieved by coupling 6 trinucleotide 
positions using the standard coupling procedure, and, for the next 10 coupling cycles, 
by omitting the capping step during DNA synthesis and by diluting the trinucleotide 
mixture to achieve stepwise coupling yields of 50%. 

The oligos are annealed and filled in with.the Klenow fragment of DNA polymerase I 
to form a double-stranded DNA cassette with standard methods (Sambrook et ai, 
1989). The cassette is digested with Sall-EcoRI, purified with Qiaex DNA gel 
extraction kit, and ligated to pre-digested fjunIB vector (Sall-EcoRI) to form the 
peptide library. The ligated peptide library is transformed into competent DH5a cells 
" harboring pUC18/IMP-p75 (see below) and plated on Luria Broth (LB) (30 pg/ml 
chloramphenicol + 100 pg/ml ampicillin) and incubated overnight at ambient 
temperature. 

c. The Ampr Cmf colonies are scraped with LB, and 1 ml of suspension is used to 
inoculate 25 ml LB (30 pg/ml chloramphenicol + 100 pg/ml ampicillin + 1 mM IPTG). 
The culture is incubated overnight at room temperature. 
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a. The vector fl 7/9-hag (Kr bber et al., 1995) is digested with EcoRI and Hindlll. The 
1.4 kb fragment containing th g ne fusion of the IMP with th HAG peptide, is 
isolated and cloned into pre-digested pUC19 (EcoRI-Hindlll) to form th vector 
pUCI 9/IMP-HAG 


4.2.7: pUC18/iMP-p75 (see Figure 17) 

a. The intracellular domain of p75 containing the C-terminal 142 amino acids is 
amplified from the cDNA clone of p75 (Chao et al., 1986) as template with the 
primers: 

p75(for): 5’ GCTGGCCCGTACGACAAGAGGTGGAACAGCTGC 
p75(rev): 5’ TCTCGAAGCTTATCACACTGGGGATGTGGC 

b. The PCR is done following standard protocols (Sambrook et al., 1989). The amplified 
product is digested with BsiWI and Hindlll, then ligated into pre-digested pUC19 
vector (BsiWI-Hindlll) to form the vector pUC19/IMP-p75. 

c. The vector pUC19/IMP-p75 is digested with Xbal and Hindlll. The 1 kb fragment is 
isolated and cloned into the pre-digested pUC18 vector (Xbal-Hindlll) to form the 
vector pUC18/IMP-p75. 

4.2.8: pUC18/IMP-IL16 (see Figure 18) 

a. The IL16 gene is amplified from the clone pcDNA3-ILHu1 (M. Baier, Paul Ehrlich 
Institute, Germany; Baier et al., 1995; Banned ef al., 1996) as template with the 
primers: 

f1Bsu36lfon 5‘AGACTGCCTCAGGCCAGCCCGACCTCAACTCC3’ 
f3H ind 11 Irev2: 5’ATATATAAG CTTTTAGG AGTCTCCAGCAGC3' 

b. The PCR is done following standard protocols (Sambrook et al., 1989). The amplified 
product is digested with Bsu36l and Hindlll, then ligated into pre-digested 
pUC18/IMP-p75 vector (Bsu36l-Hindlll) to form the vector pUC18/IMP-IL16. 


4.3: In vivo SIP with co-transformation and polyphage 
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4.3.1: Combining 2 libraries (Library 1 is fused with gill while Library 2 is fused to the 
IMP). 

10 ng ach of IjunlB, fjunIA, fpep3_1B, fhaglA, fNGFIB with 500 ng each of 
pUC18/IMP-p75, pUCI8/IMP-HAG, pUC18/IMP-IL16 are Co-transformed into DH5a 
cells by electroporation. The cells are plated on Luria Broth (LB) (30 pg/ml 
chloramphenicol + 100 pg/ml ampicillin) and incubated overnight at ambient 
temperature. 

The Amp r Cm r colonies are scraped with LB and 1 ml of suspension is used to inoculate 
25 ml LB (30 pg/ml chloramphenicol + 100 pg/ml ampicillin + 1 mM IPTG) followed by 
incubation overnight at room temperature. 

4.3.2: In vivo SIP. The supernatant from the cells is separated by centrifugation (10,000 
RPM, 10 min., 4°C). 5 ml of 30% PEG/3M NaCI are added to the supernatant and mixed 
100 times. After 1 hour on ice, the phage precipitate is collected by centrifugation 
(10,000 RPM, 10 min., 4°C). The pellet is resuspended in 1 ml TBS buffer, and the 
suspension is filtered through a 0.45 micron filter (Sartorius) 

200 pi of phage supernatant are used to infect 1.8ml of log phase K91 cells (or any 
male E. coll cells (F-pilus containing)), and the cells are plated on LB (30 pg/ml 

chloramphenicol + 100 pg/ml ampicillin) and incubated overnight at ambient 
temperature. 

4.3.3: Testing of infectious polyphage DNA patterns and infectity. Twenty individual 
Ampr Cm r colonies are used to inoculate 5 ml LB (30 pg/ml chloramphenicol + 100 
pg/ml ampicillin) in each case and incubated at ambient temperature overnight. Plasmid 
and RF DNA are isolated from each clone with a Qiagen Miniprep DNA kit. Clones are 
analysed by restriction analysis with restriction enzymes Xbal and Hindi 11 together with 
appropriate buffers as supplied and instructed by the manufacturer. The restriction 
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To be able to retrieve the genetic information for cognate protein pairs s I cted via a tag 
fused to one of the partners, two separate libraries in phagemid vectors are constructed 
containing the /ox recombination promoting sites and recombined on one phagemid by 
action of the ere recombinase in an in vivo recombination. 

5.2: Vector construction 

Both loxP and /oxP511sites (Hoess et a/., 1986) are inserted in tandem into the region 
flanked by the ColEI ori and p-lactamase in vector pINGI-CI, whereas in vector 
pONG3-A, the loxP site is cloned upstream of the Xba\ site and the /oxP511 
downstream of the H/ndlll site. Therefore, the genomic DNAs to be cloned are flanked 
by the loxP and /oxP511 sites. 

5.3: Library construction and recombination 

The libraries are prepared as in Example 3. The phagemids in the double-resistant 
clones are recombined through the ere recombinase which either is encoded in the 
phagemid being inducible (Tsurushita et a/., 1996), or is transferred through PI phage 
infection (Rosner, 1972; Waterhouse et ai, 1993). Phages are prepared from the 
recombined clones by helper phage infection and used to infect new £. coli cells (ere’). 

5.4: Selection 

The phage particles are prepared from the Cm R clones and subjected to His-tag 
selection as in Examples 2 and 3. The sequences encoded in each phagemid, which 
now contains members of both libraries, can be determined by sequencing using 
primers specific for myc-tag region (library 1) and His-tag region (library 2). 

Example 6: SIP-based library vs. library screening via in vitro 
recombination of separately constructed libraries into one phage 
vector 
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6.1: Principle (se Figure 21) 

To be abl to retrieve the genetic information for cognate protein pairs selected by SIP 
interaction in vivo, two separate librari s in phage and plasmid vectors are constructed 
and recombined by co-ligation in an in vitro recombination. 

6.2: Construction of Libraries A and B 

Library A encodes 2 members, namely a single chain Fv antibody against a peptide 
derived from hemagglutinin (fahag) and the leucine zipper domain derived from the jun 
transcription factor (fjun). both N-terminally fused to the C-terminal domain of gill from 
filamentous phage and preceded by the ompA signal sequence followed by the Flag 
epitope. 

Library B encodes 3 members on plasmid vectors of the pUC series, namely the 
hemagglutinin peptide to which the above ahag antibody binds (pl)C19-IMPhag), the 
leucine zipper domain of the fos transcription factor (pUC18-IMPfos) which 
heterodimerizes with jun via this domain, and the intracellular domain of the low affinity 
nerve growth factor receptor (pl)C18-lMPp75), as a negative control which does not 
interact with library A members, all fused to the infectivity-mediating N-terminal domains 
of phage gill protein, preceded by the gill signal sequence. 

Library A members are cloned into a fd phage vector which also contains downstream 
of the library A insertion site the N-terminal domains (N1-N2) of gill, followed by the 
cloning sites Bs/WI arid W/ndlll to allow in-frame insertion of library B members. 

Library A construct fahag is identical to the f 17/9-hag fd phage vector (Krebber et a/., 
1995) and serves as basis for construction of fjun. The jun leucine zipper together with 
amino acids 290 to 326 of the C-terminal part of gill is PCR-amplified (primers FR620 
and FR621, containing EcoRV and S/Jl sites, respectively) from the construct IjunlB 
(containing the jun leucine zipper fused to amino acids 290 to 493 of gill) generated in 
Example 4. The resulting PCR fragment is ligated directionally into EcoRV/Sfil-digested 
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fl7/9-hag vector in frame with amino acids 327 to 493 of the gill C-terminal domain 
resulting in vector fjunhag (see Figure 22). 

Generation of library B constructs pUC19-IMPhag and pUC18-IMPp75 is described in 
Example 4. To construct pUC18-IMPfos, amino acids 219 to 272 of the N-terminal part 
of gill together with the fos leucine zipper are PCR-amplified (primers FR618 and 
FR619, containing BsiWI and Hindlll sites, respectively) from the pOKI phagemid 
vector (Grammatikoff et a/., 1994). The resulting PCR fragment is ligated directionally 
into BsiWI/Hindlll-digested pUC18-IMPp75 to create pUC18-IMPfos (see Figure 17). 
Primers: 

FR618: 5’CGCCGTACGGCGGCTCTGGTGGTGGTTCTGGTGGC3’ 

FR619: 5’CCCAAGCTTTTAGACTAGCTGACTAGAAGATCTGC3’ 

FR620: 5’CGCGATATCGTCGACGCCGGTGGTCGGATCGCC3’ 

FR621: 5’CGCGGCCCCCGAGGCCCCACCACCGGAACCGCCTCCC3' 

6.3: Preparation and recombination of library A and B and selection of interacting 
protein pairs by SIP 

Non-covalent, cognate interactions of ahag antibody with hag peptide (Krebber ef a/. 
1995) and of fos and jun leucine zipper domains (Grammatikoff et a/., 1994) generates 
infective SIP phage. Thus, from the six possible combinations of members of the model 
libraries A and B (fahag-hag, fcthag-fos, fahag-p75, fjun-fos, fjun-hag, fjun-p75), only 
two combinations (cognate pairs in bold) should be selected by in vivo SIP. To 
recombine the library members in all possible permutatidris, library A is linearized by 
digestion with BsiWI/Hindlll to prepare it for random incorporation of library B members, 
prepared by mass-excision with BsiWI/Hindlll from the construct B pool described 
above. After co-ligation of the mass-excised library B fragments into library A vectors, 
the sample is transformed into competent E.coli cells, plated onto chloramphenicol- 
containing LB agar plates and grown overnight at 37°C. The recombined library size can 
be determined by plating serial dilutions of the transformation and can be compared to 
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the compl xities of the individual libraries A and B. The total recombined library is 
scraped from the plates in LB medium and used to inoculate an appropriate volum of 
chloramphenicol-selective LB-medium supplemented with 1 mM IPTG. After growth at 
30°C overnight with constant shaking to allow production of SIP phages, the bacteria 
are pelleted by centrifugation and phages present in the supernatant are precipitated on 
ice for one hour by addition of 0.25 volumes of 20% PEG/2.5 M NaCI. The phages are 
pelleted by centrifugation for 30 min at 10 000 x g and 4°C. The pellet is resuspended in 
an appropriate volume of 1 x TBS buffer and filtered through a 0.45 pM filter. Serial 
dilutions of this filtrate are used to infect F* E.coli cells. The double-stranded, replicative 
form phage DNA is prepared from resulting transductant colonies by standard methods 
and analyzed by restriction digest and sequencing for the presence and identity of 
library A and B members. Furthermore, the supernatant of transductant colonies is 
analyzed for the presence of infective SIP phages to confirm that protein-protein 
interaction of a particular pair selected from the recombined libraries A and B is 
responsible for SIP phage infectivity. 

Alternatively, the model libraries A (2 members) and B (3 members) are used to 
construct all possible combinations (listed above) individually, and equal amounts (50 
ng) of each of the 6 combinations can be co-transformed into competent E. coli cells 
followed by the steps listed above. The distribution of individual constructs after co- 
transformation as well as the distribution of transductants resulting from the model 
library can be analyzed as described above. The selective recovery of phage constructs 
which co-encode cognate protein pairs demonstrates the feasibility of SIP-based 
selection of binding partners after ari'appropriate recombination event. 

Example 7: 'Spatial' in vivo SIP 

7.1: Principle (see Figure 23) 

Coupling of information about members of interacting peptides or proteins is achieved 
by having a spatial relationship between the particles displaying the selectable or 
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sere nabl property (in this example phages for the SIP xperim nt) and the package 
containing the genetic information for the individual library members (in this exampl the 
E. coli cell secreting the phage particle being screened), i. e. a correlation between the 
phage being examined and the position of the corresponding E. coli host on the master 
plate. 

7.2: Combining 2 libraries (Library A is fused with gill while library B is fused to 
the IMP) 

10 ng each of fjunIB. fjunIA, fpep3_1B, fhaglA, fNGFIB are co-transformed with 500 
ng each of pUC18/IMP-p75, pUCI 9/IMP-HAG, pUC18/IMP-IL16 Into DH5a cells by 
electroporation. The transformants are plated on LB (30 pg/ml chloramphenicol + 100 
pg/ml ampicillin) and incubated overnight at ambient temperature. 

7.3: Screening of co-transformants by SIP 

From the master plate of co-transformants, each of the co-transformants are labelled 
and inoculated separately into 5 ml LB (30 pg/ml chloramphenicol + 100 pg/ml 
ampicillin) and incubated overnight at ambient temperature. 

Plasmid and RF DNA are isolated from each clones with a Qiagen Miniprep DNA kit. 
Clones are analysed by restriction analysis with restriction enzymes Xbal and Hindlll 
together with appropriate buffers as supplied and instructed by the manufacturer. The 
restriction digests are run in a 0.8% TBE agarose gel at constant voltage of 100 V for 1 
to 2 hours. Restriction patterns allow discrimination of the particular clones. 

The supernatant from the overnight culture of the individual clones is filtered with a 0.45 
micron filter (Sartorius). Ten microliters of phage supernatant are mixed with 100 pi of 
log phase K91 cells (or any male E. coli cells (F-pilus containing)) for 10 minutes at 
37°C. The suspension is plated on LB (30ug/ml chloramphenicol) and incubated 
overnight at 37°C. 
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A positive co-transformant (i. . contains the correct interacting pair) has a 
corresponding correct restriction pattern and is capable of producing infectious phages, 
that are incapable of secondary or subsequent infections. Polyphage particles being 
capable of such infections, and containing the genetic information of an interacting pair 
as well, can readily be identified by their restriction digest pattern. 

Example 8: E. coli display 
8.1: Principle (see Figure 24) 

Two libraries are introduced into E.coli cells, with expressed members of library A (such 
as antibody, peptide, or cDNA libraries) being presented at the surface of the cells. In 
those cases where interacting pairs are formed, members of library B (such as antibody, 
peptide, or cDNA libraries) are transported in the complex with its cognate partner to the 
surface of the cell as well, thus displaying a selectable or screenable property such as a 
tag. Selected cells contain the information for both interacting partners. 

8.2: Preparation of Library A 

A thioredoxin peptide library is prepared as fusions to the E. coli flagellin in the pFLITRX 
vector essentially as described (Lu et al., 1S95). 

8.3: Preparation of Library B 

An cyclic, variable-length peptide library including a FLAG epitope (Hopp etal., 1988; 
Knappik and PlOckthun, 1994) is prepared essentially as described in Example 4.2.4, 
and cloned in the pTERM vector, a modified version of the pto2H10a3s vector (Krebber 
et al., 1996) containing a chloramphenicol-resistance gene instead of the kanamycin- 
resistance gene. The pTERM vector can be assembled by standard methods starting 
from pto2H10a3s. This cyclic peptide library is packaged by infection with a helper 
phage (M13K07 or VCSM13) by standard methods (Sambrook etal., 1989). 
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8.4: Combination of Library A and Library B 

An aliquot of the E. coli cells containing Library A is used to inoculate 50 ml LB (100 
ng/ml ampicillin) and incubated at ambient temperature until the OD600 reached 0.4. 
The cells are infected with phages containing Library B at a multiplicity of infection (MOI) 
of 10. After 30 min of infection, the cells are collected by centrifugation (5000 RPM, 10 
minutes, 4°C) and resuspended in 1 ml LB. The suspension is plated on M9 media (+ 1 
mM MgCl2, supplemented with 0.5% glucose, 0.2% casamino acids, 100 jag/ml 
ampicillin, 30 pg/ml chloramphenicol). 

8.5: Selection of interacting pairs. . .... 

The Ampr Cmr colonies are scraped with M9 media (+ 1 mM MgCl2, supplemented with 
0.5% glucose, 0.2% casamino acids, 100 pg/ml ampicillin, 30 pg/ml chloramphenicol), 
and an aliquot of the suspension is used to inoculate 25 ml M9 media (+ 1 mM MgCl2, 
supplemented with 0.5% glucose, 0.2% casamino acids, 100 pg/ml ampicillin, 30 pg/ml 
chloramphenicol) and incubated at 37 e C until saturation. Selection is performed 
essentially as described (Lu et ai, 1995), the modification being that the antibody used 
for selection is the Ml anti-FLAG antibody (Kodak). 

Individual enriched Ampr Cmr colonies are isolated and the sequences of the 
corresponding interacting peptide(s) and cyclic peptide(s) are determined by DNA 
sequencing. To confirm that the encoded peptide and cyclic peptide form a cognate 
pair, each of the clones, is tested for enrichment based on the-selection method 

described above, whereby the Ampr Cmr colonies bind to the Ml anti-FLAG antibody in 
a single round of selection. 
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1 . A method for identifying a plurality of nucleic acid sequences, said nucleic acid 
sequences each encoding a (poly)peptide capable of interacting with at least 
one further (poly)peptide encoded by a different member of said plurality of 
nucleic acid sequences, comprising the steps of: 

(a) providing a first library of recombinant vector molecules containing 
genetically diverse nucleic acid sequences comprising a variety of nucleic 
acid sequences encoding (poly)peptides; 

(b) providing a second library of recombinant vector molecules containing 
genetically diverse nucleic acid sequences comprising a variety of nucleic 
acid sequences encoding (poly)peptides capable of interacting with further 
(poly)peptides as mentioned in step (a), wherein the vector molecules 
employed for the production of said recombinant vector molecules and/or 
the recombinant inserts display properties that are phenotypically 
distinguishable from those of the vector molecules and/or the recombinant 
inserts used in step (a) and wherein at least one of said properties 
displayed by each of said vector molecules and/or the recombinant inserts 
used in steps (a) and (b), upon the interaction of a (poly)peptide from said 

- — first library with a (poly)peptide from said second library together generate 
a screenable or selectable property; 

(c) optionally, providing additional libraries of recombinant vector molecules 
containing genetically diverse nucleic acid sequences comprising a variety 
of nucleic acid sequences encoding (poiy)peptides capable of interacting 
with or causing interaction of (a) further (poly)peptide(s) as mentioned in 
step (a) and/or step (b). wherein the vector molecules employed for the 
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production of said r combinant vector molecules and/or the recombinant 
inserts display properties that are phenotypically distinguishable from 
those of the vector molecules and/or the recombinant inserts used in steps 
(a) and (b) and, optionally, at least one of said properties displayed by said 
vector molecule and/or the recombinant inserts used in step (c) together 
with at least one of said properties displayed by either said vector 
molecule and/or said recombinant inserts used in steps (a) and/or (b), 
upon the interaction of a (poly)peptide from said additional library with 
either a (poly)peptide from said first library and/or a (poly)peptide from said 
second library generate a screenable or selectable property, 

(d) expressing members of said libraries of recombinant vectors or nucleic 
add sequences mentioned in steps (a), (b) and optionally (c), in 
appropriate host cells so that at least one interaction is established; 

(e) selecting for the generation of said screenable or selectable property 
representing the interaction of said (poly)peptides; 

(f) optionally, carrying out further screening, selection and/or purification 
steps; and 

(g) identifying said nucleic acid sequences encoding said (poly)peptides. 

2. A method for identifying a plurality of nucleic acid sequences, said nucleic add 
sequences each encoding a (poly)peptide capable of interacting with at least 
one further (poly)peptide encoded by a different member of said plurality of 
■ nudeic acid sequences, comprising the steps of: 

(a) expressing in appropriate host cells 

(aa) nucleic acid sequences contained in a first library of recombinant 
vector molecules containing genetically diverse nucleic acid 
sequences comprising a variety of nucleic acid sequences encoding 
(poly)peptides; 
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(ab) nucleic acid sequences contained in a second library of recombinant 
vector molecules containing genetically divers nucleic acid 
sequences comprising a variety of nucleic acid sequences encoding 
(poly)peptides capable of interacting with further (poly)peptides as 
mentioned in step (aa), wherein the vector molecules employed for 
the production of said recombinant vector molecules and/or the 
recombinant inserts display properties that are phenotypically 
distinguishable from those of the vector molecules and/or the 
recombinant inserts used in step (aa) and wherein at least one of 
said properties displayed by each of said vector molecules and/or the 
recombinant inserts used in steps (aa) and (ab), upon the interaction 
of a (poly)peptide from said first library with a (poly)peptide from said 

second library together generate a screenable or selectable property; 

(ac) optionally, nucleic acid sequences contained in additional libraries of 
recombinant vector molecules containing genetically diverse nucleic 
acid sequences comprising a variety of nucleic acid sequences 
encoding (poly)peptides capable of interacting with or causing 
interaction of (a) further (poly)peptide(s) as mentioned in step (aa) 
and/or step (ab), wherein the vector molecules employed for the 
production of said recombinant vector molecules and/or the 
recombinant inserts display properties that are phenotypically 
distinguishable from those of the vector molecules and/or the 
recombinant inserts used in steps (aa) and (ab) and, optionally, at 
least one of said properties displayed by said vector molecule and/or 
the recombinant inserts used in step (ac) together with at least one of 
said properties displayed by either said vector molecule and/or said 
recombinant inserts used in steps (aa) and/or (ab), upon the 
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interaction of a (poly)peptid from said additional library with either a 
(poly)peptide from said first library and/or a (poly)peptide from said 
second library generate a screenable or selectable property; 

so that at least one interaction is established; 

(b) selecting for the generation of said screenable or selectable property 
representing the interaction of said (poly)peptides; 

(c) optionally, carrying out further screening, selection and/or purification 
steps; and 

(d) identifying said nucleic acid sequences encoding said (poly)peptides. 

3 . The method according to claim 1 or 2, wherein said screenable or selectable 
property is expressed extracellularly. 

4 . The method according to any one of claims 1 to 3 wherein said recombinant 
vector molecules in step (a)/(aa) give rise to a replicable genetic package 
(RGP) displaying said (poly)peptides at its surface. 

5 . The method according to claim 4, wherein said recombinant vector molecule is 
a recombinant phage, phagemid or virus. 

6 . The method according to cJaim‘5, wherein said phage is 

(a) one of the dass I phage fd, Ml 3, If, Ike, ZJ/2, Ff; 

(b) one of the dass II phage Xf, Pfl, and Pf3; 

(c) one of the lambdoid phages, lambda, 434, PI; 

(d) one of the dass of enveloped phages, PRD1; or 

(e) one of the class paramyxo-viruses, orthomyxo-viruses, baculo-viruses, 
retro-viruses, reo-viruses and alpha-viruses. 
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7. The method according to any one of claims 4 to 6, wherein said selection step 
(e)/(b) is carried out by selecting polyphage comprising the interacting 
(poly)peptides. 

8 . The method according to any one of claims 4 to 1, wherein said screenable or 
selectable property is connected to the infectivity of said RGP. 

9. The method according to claim 8, wherein said RGP is encoded by said 
recombinant vector used in step (a)/(aa) and rendered non-infective and 
infectivity of said RGP is restored by interaction of said (poly)peptide of step 
(a)/(aa) with the (poly)peptide of step (b)/(ab) and/or (c)/(ac), said (poly)peptide 
of step (b)/(ab) and/or (c)/(ac) being fused to a domain that confers infectivity to 
said RGP. 

10. The method according to claim 9, wherein said RGP is rendered non-infective 
by modification of a genetic sequence which encodes a surface protein 
necessary for the RGP’s binding to and infection of a host cell. 

11 . The method according to any one of claims 1 to 3, wherein said recombinant 
vector molecules in step (a)/(aa) give rise to a fusion protein which is expressed 
on the surface of a cell, preferably a bacterium. 

12. The method according to claim 11. wherein said bacterium is Neisseria 
gonorrhoe or E. coli and said fusion protein consists of at least a part of a 
flagellum, lam B, peptidoglycan-associated lipoprotein or the Omp A protein 
and said (poly)peptide. 
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13. The method according to any one of claims 3 to 7, 11 or 12, wherein said 
(poly)peptides encoded by said recombinant vector molecules of step (b)/(ab) 
or (c)/(ac) are linked to at least one screenable or selectable tag. 

14. The method according to claim 13, wherein said screenable or selectable tag is 
encoded by said recombinant vector of step (b)/(ab) or (c)/(ac). 

15. The method according to claim 13 or 14, wherein said screenable or selectable 
tag is selected from the list His(n), myc, FLAG, malE, thioredoxin, GST, 
streptavidin, beta-galactosidase, alkaline phosphatase, T7 gene 10, Strep-tag 
and calmodulin. 

16. The method according to claim 13, wherein said screenable or selectable tag is 
encoded by the genome of the host cell. 

17. The method according to any one of claims 1 to 16, wherein said (poly)peptides 
encoded by the nucleic acid sequences of said additional libraries of step 
(c)/(ac) cause the interaction of said (poly)peptides of steps (a)/(aa) and (b)/(ab) 
via phosphorylation, glycosytation, methylation, lipidation or famesylation of at 
least one of said (poly)peptides of steps (a)/(aa) and (b)/(ab). 

18. The method according to any of claims 1 to 10 and 13 to 17, wherein said host 
cells in step (d)/(a) are spatially addressable, and the nucleic acid sequences 
mentioned in step (g)/(d) are retrieved from the corresponding spatially 
addressable host cell. 

19. The method according to claim 1 or 2, wherein said screenable or selectable 
property is expressed intracellulariy. 



WO 97/32017 PCT/EP97/00931 

S3 

20. The method according to claim 19, wherein said screenabl or selectable 
property is the transactivation of transcription of a reporter gene such as beta- 
galactosidase, alkalin phosphatase or nutritional markers such as his3 and 
leu, or resistance genes giving resistance to an antibiotic such as ampicillin. 
chloramphenicol, kanamycin, zeocin, neomycin, tetracycline or streptomycin. 

21. The method according to any one of claims 1 to 20, wherein said recombinant 
vectors of step (a)/(aa), (b)/(ab) and (c)/(ac) comprise recombination promoting 
sites and in said step (e)/(b) recombination events are selected for, wherein 
said nucleic acid sequences encoding said (poly)peptides of step (a)/(aa), said 
nucleic acid sequences encoding said (poly)peptides of step (b)/(ab) and 
optionally said nucleic acid sequences encoding said (poly)peptides of step 
(c)/(ac) are contained in the same vector. 

22 . The method according to claim 21, wherein said recombination events are 
mediated by the site-specific recombination mechanisms Cre-lox, attP-attB, Mu 
gin or yeast flp. 

23. The method according to claim 21 wherein said recombination promotion sites 
are restriction enzyme recognition sites and said recombination event is 
achieved by cutting the recombinant vector molecules mentioned in step 
(a)/(aa), (b)/(ab) and optionally (cy(ac) with at least two different restriction 
enzymes and effecting recombination of the nucleic acid sequences contained 
in said vectors by ligation. 

24. The method according to any one of claims 1 to 23 wherein said identification 
of said nucleic acid sequences is effected after the selection of step (e)/(b) via 
PCR and preferably sequencing of said nucleic acid sequences after said PCR. 
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25. Th method according to any one of claims 1 to 24, wherein said recombinant 
vectors of step (a)/(aa), (b)/(ab) and/or (c)/(ac) comprise at least one g ne 
encoding a selection marker. 

26. The method according to claim 25, wherein said selection marker is a 
resistance to an antibiotic, preferably to ampicillin, chloramphenicol, kanamycin, 
zeocin, neomycin, tetracycline or streptomycin. 

27. The method according to any. one of claims 1 to 26, wherein said host cells are 
F and preferably E.coli XL-1 Blue, K91 or its derivatives, TGI, XLIkan or 
TOPI OF. 

28. The method according to any one of claims 3 to 18 and 21 to 27, wherein said 
RGPs are produced with the use of helper phage taken from the list R408, 
Ml 3k07 and VCSM13, M13de13, fCA55 and fKN16 or derivatives thereof. 

29. The method according to any of claims 1 to 28, wherein at least one of said 
genetically diverse nucleic acid sequences encode members of the 
immunoglobulin superfamily. 

30. The method according to claim 29, wherein said genetically diverse nucleic acid 
sequences encode a repertoire of immunoglobulin heavy or light chains. 

31. The method according to any of claims 1 to 30, in which said genetically diverse 
nucleic acid sequences are generated by a mutagenesis method. 

32. The method according to any of claims 1 to 31, in which said genetically diverse 
nucleic acid sequences are generated from a cDNA library. 
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33. The method according to any one of claims 1 to 32 wherein said nucl ic acid 

sequences are genes or parts thereof. 

34. Kit comprising at least 

(a) a recombinant vector molecule as described in step (a)/(aa) or a 
corresponding vector molecule; 

(b) a recombinant vector molecule as described in step (b)/(ab) or a 
corresponding vector molecule; and, optionally, 

(c) at least one further recombinant vector molecule as described in step 
(c)/(ac) or a corresponding vector molecule. 
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Figure 1: General description of the polyphage principle 
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Figure 1: General description of the polyphage principle (cont.) 
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Figure 2: Co-transformation of two phagemids, polyphage 
formation and selection via His-tag: general description 
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Figure 3: pBS vector series: functional map and sequence of 
pBS13 
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Figure 3 : pBS vector series: functional map and sequence of 
pBS13 (continued) 


1 ACCCGACACC ATCGAATGGC GCAAAACCTT TCGCGGTATG GCATGATAGC 

TGGGCTGTGG TAGCTTACCG CGTTTTGGAA AGCGCCATAC CGTACTATCG 

SI GCCCGGAAGA GAGTCAATTC AGGGTGGTGA ATGTGAAACC AGTAACGTTA 

CGGGCCTTCT CTCAGTTAAG TCCCACCACT TACACTTTGG TCATTGCAAT 

101 TACGATGTCG CAGAGTATGC CGGTGTCTCT TATCAGACCG TTTCCCGCGT 

ATGCTACAGC GTCTCATACG GCCACAGAGA ATAGTCTGGC AAAGGGCGCA 

151 GGTGAACCAG GCCAGCCACG TTTCTGCGAA AACGCGGGAA AAAGTGGAAG 

CCACTTGGTC CGGTCGGTGC AAAGACGCTT TTGCGCCCTT TTTCACCTTC 

201 CGGCGATGGC GGAGCTGAAT TACATTCCCA ACCGCGTGGC ACAACAACTG 

GCCGCTACCG CCTCGACTTA ATGTAAGGGT TGGCGCACCG TGTTGTTGAC 

251 GCGGGCAAAC AGTCGTTGCT GATTGGCGTT GCCACCTCCA GTCTGGCCCT 

CGCCCGTTTG TCAGCAACGA CTAACCGCAA CGGTGGAGGT CAGACCGGGA 

301 GCACGCGCCG TCGCAAATTG TCGCGGCGAT TAAATCTCGC GCCGATCAAC 

CGTGCGCGGC AGCGTTTAAC AGCGCCGCTA ATTTAGAGCG CGGCTAGTTG 

351 TGGGTGCCAG CGTGGTGGTG TCGATGGTAG AACGAAGCGG CGTCGAAGCC 

ACCCACGGTC GCACCACCAC AGCTACCATC TTGCTTCGCC GCAGCTTCGG 

401 TGTAAAGCGG CGGTGCACAA TCTTCTCGCG CAACGCGTCA GTGGGCTGAT 

ACATTTCGCC GCCACGTGTT AGAAGAGCGC GTTGCGCAGT CACCCGACTA 

451 CATTAACTAT CCGCTGGATG ACCAGGATGC CATTGCTGTG GAAGCTGCCT 

GTAATTGATA GGCGACCTAC TGGTCCTACG GTAACGACAC CTTCGACGGA 

501 GCACTAATGT TCCGGCGTTA TTTCTTGATG TCTCTGACCA GACACCCATC 

CGTGATTACA AGGCCGCAAT AAAGAACTAC AGAGACTGGT CTGTGGGTAG 

551 AACAGTATTA TTTTCTCCCA TGAAGACGGT ACGCGACTGG GCGTGGAGCA 

TTGTCATAAT AAAAGAGGGT ACTTCTGCCA TGCGCTGACC CGCACCTCGT 

601 TCTGGTCGCA TTGGGTCACC AGCAAATCGC GCTGTTAGCG GGCCCATTAA 

AGACCAGCGT AACCCAGTGG TCGTTTAGCG CGACAATCGC CCGGGTAATT 

651 GTTCTGTCTC GGCGCGTCTG CGTCTGGCTG GCTGGCATAA ATATCTCACT 

CAAGACAGAG CCGCGCAGAC GCAGACCGAC CGACCGTATT TATAGAGTGA 

701 CGCAATCAAA TTCAGCCGAT AGCGGAACGG GAAGGCGACT GGAGTGCCAT 

GCGTTAGTTT AAGTCGGCTA TCGCCTTGCC CTTCCGCTGA CCTCACGGTA 

751 GTCCGGTTTT CAACAAACCA TGCAAATGCT GAATGAGGGC ATCGTTCCCA 

CAGGCCAAAA GTTGTTTGGT ACGTTTACGA CTTACTCCCG TAGCAAGGGT 

801 CTGCGATGCT GGTTGCCAAC GATCAGATGG CGCTGGGCGC AATGCGCGCC 

GACGCTACGA CCAACGGTTG CTAGTCTACC GCGACCCGCG TTACGCGCGG 
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Figure 3: pBS vector series: functional map and sequence of 
pBS13 (continued) 

851 ATTACCGAGT CCGGGCTGCG CGTTGGTGCG GACATCTCGG TAGTGGGATA 

TAATGGCTCA GGCCCGACGC GCAACCACGC CTGTAGAGCC ATCACCCTAT 

901 CGACGATACC GAAGACAGCT CATGTTATAT CCCGCCGTTA ACCACCATCA 

GCTGCTATGG CTTCTGTCGA GTACAATATA GGGCGGCAAT TGGTGGTAGT 

951 AACAGGATTT TCGCCTGCTG GGGCAAACCA GCGTGGACCG CTTGCTGCAA 
TTGTCCTAAA AGCGGACGAC CCCGTTTGGT CGCACCTGGC GAACGACGTT 

1001 CTCTCTCAGG GCCAGGGGGT GAAGGGCAAT CAGCTGTTGC CCGTCTCACT 1 
GAGAGAGTCC CGGTCCGCCA CTTCCCGTTA GTCGACAACG GGCAGAGTGA 

1051 GGTGAAAAGA AAAACCACCC TGGCGCCCAA TACGCAAACC GCCTCTCCCC 
CCACTTTTCT TTTTGGTGGG ACCGCGGGTT ATGCGTTTGG CGGAGAGGGG 

1101 GCGCGTTGGG CGATTCATTA ATGCAGCTGG GACGACAGGT TTCCCGACTG 
CGCGCAACCG GCTAAGTAAT TACGTCGACC GTGCTGTCCA AAGGGCTGAC 

1151 GAAAGCGGGC AGTGAGCGGT ACCCGATAAA AGCGGCTTCC TGACAGGAGG 
CTTTCGCCCG TCACTCGCCA TGGGCTATTT TCGCCGAAGG ACTGTCCTCC 

1201 CCGTTTTGTT TTGCAGCCCA CCTCAACGCA ATTAATGTGA GTTAGCTCAC 
GGCAAAACAA AACGTCGGGT GGAGTTGCGT TAATTACACT CAATCGAGTG 

1251 TCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATGTTGT 
AGTAATCCGT GGGGTCCGAA ATGTGAAATA CGAAGGCCGA GCATACAACA 

1301 GTGGAATTGT GAGCGGATAA CAATTTCACA CAGGAAACAG CTATGACCAT 
CACCTTAACA CTCGCCTATT GTTAAAGTGT GTCCTTTGTC GATACTGGTA 

Xbal 


1351 GATTACGAAT TTCTAGAGGT TGAGGTGATT TTATGAAAAA GAATATCGCA 
CTAATGCTTA AAGATCTCCA ACTCCACTAA AATACTTTTT CTTATAGCGT 

1401 TTTCTTCTTG CATCTATGTT CGTTTTTTCT ATTGCTACAA ATGCATACGC 
AAAGAAGAAC GTAGATACAA GCAAAAAAGA TAACGATGTT TACGTATGCG 

EcoRI 

1451 TGAATTCCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT 
ACTTAAGGTG GGTCTTTGCG ACCACTTTCA TTTTCTACGA CTTCTAGTCA 

1501 TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC 
ACCCACGTGC TCACCCAATG TAGCTTGACC TAGAGTTGTC GCCATTCTAG 

1551 CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA 
GAACTCTCAA AAGCGGGGCT TCTTGCAAAA GGTTACTACT CGTGAAAATT 

1601 AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC 
TCAAGACGAT ACACCGCGCC ATAATAGGGC ATAACTGCGG CCCGTTCTCG 
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Figure 3: pBS vector series: functional map and sequence of 
pBS 13 (continued) 


Seal 


1651 AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA 

TTGAGCCAGC GGCGTATGTG ATAAGAGTCT TACTGAACCA ACTCATGAGT 

1701 CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG 

GGTCAGTGTC TTTTCGTAGA ATGCCTACCG TACTGTCATT CTCTTAATAC 

1751 CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 

GTCACGACGG TATTGGTACT CACTATTGTG ACGCCGGTTG AATGAAGACT 

1001 CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG 

GTTGCTAGCC TCCTGGCTTC CTCGATTGGC GAAAAAACGT GTTGTACCCC 

-1851 GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT 

CTAGTACATT GAGCGGAACT AGCAACCCTT GGCCTCGACT TACTTCGGTA 

1901 ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT 

TGGTTTGCTG CTCGCACTGT GGTGCTACGG ACATCGTTAC CGTTGTTGCA 

1951 TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA 

ACGCGTTTGA TAATTGACCG CTTGATGAAT GAGATCGAAG GGCCGTTGTT 

2001 TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC 

AATTATCTGA CCTACCTCCG CCTATTTCAA CGTCCTGGTG AAGACGCGAG 

2051 GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC 

CCGGGAAGGC CGACCGACCA AATAACGACT ATTTAGACCT CGGCCACTCG 

2101 GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC 

CACCCAGAGC GCCATAGTAA CGTCGTGACC CCGGTCTAGC ATTCGGGAGG 

2151 CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG 

GCATAGCATC "AATAGATGTG CTGCCCCTCA' GTCCGTTGAT ACCTACTTGC 

2201 AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAT 

TTTATCTGTC TAGCGACTCT ATCCACGGAG TGACTAATTC GTAACCATTA 

Hindlll 


2251 GAGCATGCAA GCTTGACCTG TGAAGTGAAA AATGGCGCAC ATTGTGCGAC 

CTCGTACGTT CGAACTGGAC ACTTCACTTT TTACCGCGTG TAACACGCTG 

2301 ATTTTTTTTG TCTGCCGTTT ACCGCTACTG CGTCACGGAT CCCCACGCGC 

TAAAAAAAAC AGACGGCAAA TGGCGATGAC GCAGTGCCTA GGGGTGCGCG 

2351 CCTGTAGCGG CGCATTAAGC GCGGCGGGTG TGGTGGTTAC GCGCAGCGTG 

GGACATCGCC GCGTAATTCG CGCCGCCCAC ACCACCAATG CGCGTCGCAC 

2401 ACCGCTACAC TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG CTTTCTTCCC 

TGGCGATGTG AACGGTCGCG GGATCGCGGG CGAGGAAAGC GAAAGAAGGG 
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Figure 3: pBS vector series: functional map and sequence of 
pBS13 (continued) 

2451 TTCCTTTCTC GCCACGTTCG CCGGCTTTCC CCGTCAAGCT CTAAATCGGG 
AAGGAAAGAG CGGTGCAAGC GGCCGAAAGG GGCAGTTCGA GATTTAGCCC 

2501 GCATCCCTTT AGGGTTCCGA TTTAGTGCTT TACGGCACCT CGACCCCAAA 
CGTAGGGAAA TCCCAAGGCT AAATCACGAA ATGCCGTGGA GCTGGGGTTT 

2551 AAACTTGATT AGGGTGATGG TTCACGTAGT GGGCCATCGC CCTGATAGAC 
TTTGAACTAA TCCCACTACC AAG.TGCATCA CCCGGTAGCG GGACTATCTG 

2601 GGTTTTTCGC CCTTTGACGT TGGAGTCCAC GTTCTTTAAT AGTGGACTCT 
CCAAAAAGCG GGAAACTGCA ACCTCAGGTG CAAGAAATTA TCACCTGAGA 

2651 TGTTCCAAAC TGGAACAACA CTCAACCCTA TCTCGGTCTA TTCTTTTGAT 
ACAAGGTTTG ACCTTGTTGT GAGTTGGGAT AGAGCCAGAT AAGAAAACTA 

2701 TTATAAGGGA TTTTGCCGAT TTCGGCCTAT TGGTTAAAAA ATGAGCTGAT 
AATATTCCCT AAAACGGCTA AAGCCGGATA ACCAATTTTT TACTCGACTA 

2751 TTAACAAAAA TTTAACGCGA ATTTTAACAA AATATTAACG TTTACAATTT 
AATTGTTTTT AAATTGCGCT TAAAATTGTT TTATAATTGC AAATGTTAAA 

2801 CAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT 

GTCCACCGTG AAAAGCCCCT TTACACGCGC CTTGGGGATA AACAAATAAA 

2851 TTCTAAATAC ATTCAAATAT GTATCCGCTC ATGTCGAGAC GTTGGGTGAG 

AAGATTTATG TAAGTTTATA CATAGGCGAG TACAGCTCTG CAACCCACTC 

2901 GTTCCAACTT TCACCATAAT GAAATAAGAT CACTACCGGG CGTATTTTTT 

CAAGGTTGAA AGTGGTATTA CTTTATTCTA GTGATGGCCC GCATAAAAAA 

2951 GAGTTATCGA GATTTTCAGG AGCTAAGGAA GCTAAAATGG AGAAAAAAAT 

CTCAATAGCT__CTAAAAGTCC TCGATTCCTT CGATTTTACC TCTTTTTTTA 

3001 CACTGGATAT ACCACCGTTG ATATATCCCA ATGGCATCGT AAAGAACATT 

GTGACCTATA TGGTGGCAAC TATATAGGGT TACCGTAGCA TTTCTTGTAA 

3051 TTGAGGCATT TCAGTCAGTT GCTCAATGTA CCTATAACCA GACCGTTCAG 

AACTCCGTAA AGTCAGTCAA CGAGTTACAT GGATATTGGT CTGGCAAGTC 

3101 CTGGATATTA CGGCCTTTTT AAAGACCGTA AAGAAAAATA AGCACAAGTT 

GACCTATAAT GCCGGAAAAA TTTCTGGCAT TTCTTTTTAT TCGTGTTCAA 

3151 TTATCCGGCC TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG 

AATAGGCCGG AAATAAGTGT AAGAACGGGC GGACTACTTA CGAGTAGGCC 

3201 AGTTCCGTAT GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT 

TCAAGGCATA CCGTTACTTT CTGCCACTCG ACCACTATAC CCTATCACAA 

3251 CACCCTTGTT ACACCGTTTT CCATGAGCAA ACTGAAACGT TTTCATCGCT 

GTGGGAACAA TGTGGCAAAA GGTACTCGTT TGACTTTGCA AAAGTAGCGA 
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Figure 3: pBS vector series: functional map and sequence of 
pBS 13 (continued) 

3301 CTGGAGTGAA TACCACGACG ATTTCCGGCA GTTTCTACAC ATATATTCGC 
GACCTCACTT ATGGTGCTGC TAAAGGCCGT CAAAGATGTG TATATAAGCG 

3351 AAGATGTGGC GTGTTACGGT GAAAACCTGG CCTATTTCCC TAAAGGGTTT 
TTCTACACCG CACAATGCCA CTTTTGGACC GGATAAAGGG ATTTCCCAAA 

3401 ATTGAGAATA TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA GTTTCACCAG 
TAACTCTTAT ACAAAAAGCA GAGTCGGTTA GGGACCCACT CAAAGTGGTC 

3451 TTTTGATTTA AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA 
AAAACTAAAT TTGCACCGGT TATACCTGTT GAAGAAGCGG GGGCAAAAGT 

3501 CCATGGGCAA ATATTATACG CAAGGCGACA AGGTGCTGAT GCCGCTGGCG 
GGTACCCGTT TATAATATGC GTTCCGCTGT.TCCACGACTA CGGCGACCGC 

3551 ATTCAGGTTC ATCATGCCGT CTGTGATGGC TTCCATGTCG GCAGAATGCT 
TAAGTCCAAG TAGTACGGCA GACACTACCG AAGGTACAGC CGTCTTACGA 

Seal 

A. 

3601 TAATGAATTA CAACAGTACT GCGATGAGTG GCAGGGCGGG GCGTAATTTT 
ATTACTJAAT GTTGTCATGA CGCTACTCAC CGTCCCGCCC CGCATTAAAA 

3651 TTTAAGGCAG TTATTGGTGC CCTTAAACGC CTGGTGCTAC GCCTGAATAA 
AAATTCCGTC AATAACCACG GGAATTTGCG GACCACGATG CGGACTTATT 

3701 GTGATAATAA GCGGATGAAT GGCAGAAATT CGAAAGCAAA TTCGACCCGG 
CACTATTATT CGCCTACTTA CCGTCTTTAA GCTTTCGTTT AAGCTGGGCC 

3751 TCGTCGGTTC AGGGCAGGGT CGTTAAATAG CCGCTTATGT CTATTGGTGG 
AGCAGCCAAG TCCCGTCCCA GCAATTTATC GGCGAATACA GATAACGACC 

'3801— TTTACCGGTT TATTGACTAC CGGAAGCAGT GTGACCGTGT GCTTCTCAAA 
AAATGGCCAA ATAACTGATG GCCTTCGTCA CACTGGCACA CGAAGAGTTT 

3851 TGCCTGAGGC CAGTTTGCTC AGGCTCTCCC CGTGGAGGTA ATAATTGCTC 
ACGGACTCCG GTCAAACGAG TCCGAGAGGG GCACCTCCAT TATTAACGAG 

3901 GACATGACCA AAATCCCTTA ACGTGAGTTT TCGTTCCACT GAGCGTCAGA 
CTGTACTGGT TTTAGGGAAT TGCACTCAAA AGCAAGGTGA CTCGCAGTCT 

3951 CCCCGTAGAA AAGATCAAAG GATCTTCTTG AGATCCTTTT TTTCTGCGCG 
GGGGCATCTT TTCTAGTTTC CTAGAAGAAC TCTAGGAAAA AAAGACGCGC 

4001 TAATCTGCTG CTTGCAAACA AAAAAACCAC CGCTACCAGC GGTGGTTTGT 
ATTAGACGAC GAACGTTTGT TTTTTTGGTG GCGATGGTCG CCACCAAACA 

4051 TTGCCGGATC AAGAGCTACC AACTCTTTTT CCGAAGGTAA CTGGCTTCAG 
AACGGCCTAG TTCTCGATGG TTGAGAAAAA GGCTTCCATT GACCGAAGTC 
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Figure 3: pBS vector series: functional map and sequence of 
pBS13 (continued) 


4101 CAGAGCGCAG ATACCAAATA CTGTCCTTCT AGTGTAGCCG TAGTTAGGCC 
GTCTCGCGTC TATGGTTTAT GACAGGAAGA TCACATCGGC ATCAATCCGG 

4151 ACCACTTCAA GAACTCTGTA GCACCGCCTA CATACCTCGC TCTGCTAATC 
TGGTGAAGTT CTTGAGACAT CGTGGCGGAT GTATGGAGCG AGACGATTAG 

4201 CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC TTACCGGGTT 
GACAATGGTC ACCGACGACG GTCACCGCTA TTCAGCACAG AATGGCCCAA 

4251 GGACTCAAGA CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG 
CCTGAGTTCT GCTATCAATG GCCTATTCCG CGTCGCCAGC CCGACTTGCC 

4301 GGGGTTCGTG CACACAGCCC AGCTTGGAGC GAACGACCTA CACCGAACTG 
CCCCAAGCAC GTGTGTCGGG TCGAACCTCG CTTGCTGGAT GTGGCTTGAC 

4351 AGATACCTAC AGCGTGAGCT ATGAGAAAGC GCCACGCTTC CCGAAGGGAG 
TCTATGGATG TCGCACTCGA TACTCTTTCG CGGTGCGAAG GGCTTCCCTC 

4401 AAAGGCGGAC AGGTATCCGG TAAGCGGCAG GGTCGGAACA GGAGAGCGCA 
TTTCCGCCTG TCCATAGGCC ATTCGCCGTC CCAGCCTTGT CCTCTCGCGT 

4451 CGAGGGAGCT TCCAGGGGGA AACGCCTGGT ATCTTTATAG TCCTGTCGGG 
GCTCCCTCGA AGGTCCCCCT TTGCGGACCA TAGAAATATC AGGACAGCCC 

4501 TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT CGTCAGGGGG 
AAAGCGGTGG AGACTGAACT CGCAGCTAAA AACACTACGA GCAGTCCCCC 

4551 GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGCCTTTTTA CGGTTCCTGG 
CGCCTCGGAT ACCTTTTTGC GGTCGTTGCG CCGGAAAAAT GCCAAGGACC 

4601 CCTTTTGCTG GCCTTTTGCT CACATG 
GGAAAACGAC CGGAAAACGA GTGTAC 
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Figure 4: Co-existence of phagemids: results of restriction digest 


R1 


R1R2R2M1 123 45M2678 910 



11/44 

SUBSTITUTE SHEET (RULE 26) 



WO 97/32017 


PCT/EP97/00931 


Figure 5: Phagemid vector pYINGl-Cl: functional map 
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Figure 6: Phagemid vector pYANG3-A: functional map 
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Figure 7: Analysis of selected clones (see Table 2) 


7.a: Restriction digest of clones before and after selection 
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Figure 8: Phagemid vector pINGl-Cl: functional map 
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Figure 9: Phagemid vector pONG3-A: functional map 
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Figure 10: Co-transformation of phage and plasmid, polyphage 
formation and selection via SIP: general description 
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Figure 11: Phage vector fhagl A: functional map 
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Figure 1 la: CAT gene module: functional map and sequence 
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Figure 11a: CAT gene module: functional map and sequence 
(cont.) 

Aatll 


1 GGGACGTCGG GTGAGGTTCC AACTTTCACC ATAATG AAA T AAGATCACTA 
CCCTGCAGCC CACTCCAAGG TTGAAAGTGG TATTACTTTA TTCTAGTGAT 

51 CCGGGCGTAT T TTT T GAGTT ATCGAGATTT TCAGGAGCTA AG GAAG CTAA 
GGCCCGCATA AAAAACTCAA TAGCTCTAAA AGTCCTGGAT TCCTTCGATT 

101 AATGGAGAAA AAAATCACTG GATATACCAC CGTTGATATA TCCCAATGGC 
TTACCTCTTT TTTTAGTGAC CTATATGGTG GCAACTATAT AGGGTTACCG 

151 ATCGTAAAGA ACATTTTGAG GCATTTCAGT CAGTTGCTCA ATGTACCTAT 
TAGCATTTCT TGTAAAACTC CGTAAAGTCA GTCAACGAGT TACATGGATA 

201 AACCAGACCG TTCAGCTGGA TATTACGGCC TTTTTAAAGA CCGT AAAG AA 
TTGGTCTGGC AAGTCGACCT ATAATGCCGG AAAAATTTCT GGCATTTCTT 

251 AAATAAGCAC AAGTTTTATC CGGCCTTTAT TCACATTCTT GCCCGCCTGA 
TTTATTCGTG TTCAAAATAG GCCGGAAATA AGTGTAAGAA CGGGCGGACT 

301 TGAATGCTCA CCCGGAGTTC CGTATGGCAA TGAAAGACGG TGAGCTGGTG 
ACTTACGAGT GGGCCTCAAG GCATACCGTT ACTTTCTGCC ACTCGACCAC 

351 ATATGGGATA GTGTTCACCC TTGTTACACC GTTTTCCATG AGC AAAC TGA 
TATACCCTAT CACAAGTGGG AACAATGTGG CAAAAGGTAC TCGTTTGACT 

401 AACGTTTTCA TCGCTCTGGA GTGAATACCA CGACGATTTC CGGCAGTTTC 
TTGCAAAAGT AGCGAGACCT CACTTATGGT GCTGCTAAAG GCCGTCAAAG 

451 TACACATATA TTCGCAAGAT GTGGCGTGTT ACGGTGAAAA CCTGGCCTAT 
ATGTGTATAT AAGCGTTCTA CACCGCACAA TGCCACTTTT GGACCGGATA 

501 TTCCCTAAAG GGTTTATTGA GAATATGTTT TTCGTCTCAG CCAATCCCTG 
AAGGGATTTC CCAAATAACT CTTATACAAA AAGCAGAGTC GGTTAGGGAC 

551 GGTGAGTTTC ACCAGTTTTG ATTTAAACGT AGCCAATATG GACAACTTCT 
CCACTCAAAG TGGTCAAAAC TAAATTTGCA TCGGTTATAC CTGTTGAAGA 

601 TCGCCCCCGT TTTCACTATG GGCAAATATT ATACGCAAGG CGACAAGGTG 
AGCGGGGGCA AAAGTGATAC CCGTTTATAA TATGCGTTCC GCTGTTCCAC 

651 CTGATGCCGC TGGCGATTCA GGTTCATCAT GCCGTTTGTG ATGGCTTCCA 
GACTACGGCG ACCGCTAAGT CCAAGTAGTA CGGCAAACAC TACCGAAGGT 

701 TGTCGGCAGA ATGCTTAATG AATTACAACA GTACTGCGAT GAGTGGCAGG 
ACAGCCGTCT TACGAATTAC TTAATGTTGT CATGACGCTA CTCACCGTCC 

751 GCGGGGCGTA ATTTTTTTAA GGCAGTTATT GGGTGCCCTT AAAC GCCTGG 
CGCCCCGCAT TAAAAAAATT CCGTCAATAA CCCACGGGAA TTTGCGGACC 

Bglll 


801 TGCTAGATCT TCC 
ACGATCTAGA AGG 


20/44 


SUBSTITUTE SHEET (RULE 26) 





WO 97/32017 


PCT/EP97/00931 


Figure 12: Phage vector fjunl A: functional map 
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Figure 13: Phage vector ijunlB: functional map 
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Figure 14: Phage vector fpep3_lB: functional map 
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Figure 15: Phage vector fNGF_lB: functional map 
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Figure 16: Plasmid pUC19/IMPhag: functional map 
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Figure 17: Plasmid pUC18/IMPp75: functional map 
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Figure 18: Plasmid pUC18/IMPILl6: functional map 
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Figure 19: Analysis of selected clones (see Table 3) 
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Figure 20: Co-transformation of phagemids, in vivo 
recombination and selection via His-tag: general description 
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Figure 22: Phage vector fjunhag: functional map 
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Figure 23: Spatial in vivo SIP: general description 
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Figure 24: E. coli display: general description 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence (continued) 

1 ACCCGACACC ATCGAATGGC GCAAAACCTT TCGCGGTATG GCATGATAGC 
TGGGCTGTGG TAGCTTACCG CGTTTTGGAA AGCGCCATAC CGTACTATCG 

51 GCCCGGAAGA GAGTCAATTC AGGGTGGTGA ATGTGAAACC AGTAACGTTA 
CGGGCCTTCT CTCAGTTAAG TCCCACCACT TACACTTTGG TCATTGCAAT 

101 TACGATGTCG CAGAGTATGC CGGTGTCTCT TATCAGACCG TTTCCCGCGT 
ATGCTACAGC GTCTCATACG GCCACAGAGA ATAGTCTGGC AAAGGGCGCA 

151 GGTGAACCAG GCCAGCCACG TTTCTGCGAA AACGCGGGAA AAAGTGGAAG 
CCACTTGGTC CGGTCGGTGC AAAGACGCTT TTGCGCCCTT TTTCACCTTC 

201 CGGCGATGGC GGAGCTGAAT TACATTCCCA ACCGCGTGGC ACAACAACTG 
GCCGCTACCG CCTCGACTTA ATGTAAGGGT TGGCGCACCG TGTTGTTGAC 

251 GCGGGCAAAC AGTCGTTGCT GATTGGCGTT GCCACCTCCA GTCTGGCCCT 
CGCCCGTTTG TCAGCAACGA CTAACCGCAA CGGTGGAGGT CAGACCGGGA 

301 GCACGCGCCG TCGCAAATTG TCGCGGCGAT TAAATCTCGC GCCGATCAAC 
CGTGCGCGGC AGCGTTTAAC AGCGCCGCTA ATTTAGAGCG CGGCTAGTTG 

351 TGGGTGCCAG CGTGGTGGTG TCGATGGTAG AACGAAGCGG CGTCGAAGCC 
ACCCACGGTC GCACCACCAC AGCTACCATC TTGCTTCGCC GCAGCTTCGG 

401 TGTAAAGCGG CGGTGCACAA TCTTCTCGCG CAACGCGTCA GTGGGCTGAT 
ACATTTCGCC GCCACGTGTT AGAAGAGCGC GTTGCGCAGT CACCCGACTA 

451 CATTAACTAT CCGCTGGATG ACCAGGATGC CATTGCTGTG GAAGCTGCCT 
GTAATTGATA GGCGACCTAC TGGTCCTACG GTAACGACAC CTTCGACGGA 

501 GCACTAATGT TCCGGCGTTA TTTCTTGATG TCTCTGACCA GACACCCATC 
CGTGATTACA AGGCCGCAAT AAAGAACTAC AGAGACTGGT CTGTGGGTAG 

551 AACAGTATTA TTTTCTCCCA TGAAGACGGT ACGCGACTGG GCGTGGAGCA 
TTGTCATAAT AAAAGAGGGT ACTTCTGCCA TGCGCTGACC CGCACCTCGT 

601 TCTGGTCGCA TTGGGTCACC AGCAAATCGC GCTGTTAGCG GGCCCATTAA 
AGACCAGCGT AACCCAGTGG TCGTTTAGCG CGACAATCGC CCGGGTAATT 

651 GTTCTGTCTC GGCGCGTCTG CGTCTGGCTG GCTGGCATAA ATATCTCACT 
CAAGACAGAG CCGCGCAGAC GCAGACCGAC CGACCGTATT TATAGAGTGA 

701 CGCAATCAAA TTCAGCCGAT AGCGGAACGG GAAGGCGACT GGAGTGCCAT 
GCGTTAGTTT AAGTCGGCTA TCGCCTTGCC CTTCCGCTGA CCTCACGGTA 

751 GTCCGGTTTT CAACAAACCA TGCAAATGCT GAATGAGGGC ATCGTTCCCA 
CAGGCCAAAA GTTGTTTGGT ACGTTTACGA CTTACTCCCG TAGCAAGGGT 

801 CTGCGATGCT GGTTGCCAAC GATCAGATGG CGCTGGGCGC AATGCGCGCC 
GACGCTACGA CCAACGGTTG CTAGTCTACC GCGACCCGCG TTACGCGCGG 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence (continued) 

851 ATTACCGAGT CCGGGCTGCG CGTTGGTGCG GACATCTCGG TAGTGGGATA 

TAATGGCTCA GGCCCGACGC GCAACCACGC CTGTAGAGCC ATCACCCTAT 

qm rr,^rr,^A r ' r ' r^^r.^rnr.rT naTRTTATAT CCCGCCGTTA ACCACCATCA 

GCTGCTATGG CTTCTGTCGA GTACAATATA GGGCGGCAAT TGGTGGTAGT 

951 AACAGGATTT TCGCCTGCTG GGGCAAACCA GCGTGGACCG CTTGCTGCAA 

TTGTCCTAAA AGCGGACGAC CCCGTTTGGT CGCACCTGGC GAACGACGTT 

1001 CTCTCTCAGG GCCAGGCGGT GAAGGGCAAT CAGCTGTTGC CCGTCTCACT 
GAGAGAGTCC CGGTCCGCCA CTTCCCGTTA GTCGACAACG GGCAGAGTGA 

1051 GGTGAAAAGA AAAACCACCC TGGCGCCCAA TACGCAAACC GCCTCTCCCC 
... CCACT.TTTCT TT-TTGGTGGG ACCGCGGGTT ATGCGTTTGG CGGAGAGGGG 

1101 GGGCGTTGGC CGATTCATTA ATGCAGCTGG CACGACAGGT TTCCCGACTG 
CGCGCAACCG GCTAAGTAAT TACGTCGACC GTGCTGTCCA AAGGGCTGAC 

1151 GAAAGCGGGC AGTGAGCGGT ACCCGATAAA AGCGGCTTCC TGACAGGAGG 
CTTTGGCCCG TCACTCGCCA TGGGCTATTT TCGCCGAAGG ACTGTCCTCC 

1201 CCGTTTTGTT TTGCAGCCCA CCTCAACGCA ATTAATGTGA GTTAGCTCAC 
GGCAAAACAA AACGTCGGGT GGAGTTGCGT TAATTACACT CAATCGAGTG 

1251 TCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATGTTGT 
AGTAATCCGT GGGGTCCGAA ATGTGAAATA CGAAGGCCGA GCATACAACA 

1301 GTGGAATTGT GAGCGGATAA CAATTTCACA CAGGAAACAG CTATGACCAT 
CACCTTAACA CTCGCCTATT GTTAAAGTGT GTCCTTTGTC GATACTGGTA 

Xbal 


1351 GATTACGAAT TTCTAGATAA CGAGGGCAAA AAATGAAAAA GACAGCTATC 

CTAATGCTTA AAGATCTATT GCTCCCGTTT TTTACTTTTT CTGTCGATAG 

1401 GCGATTGCAG TGGCACTGGC TGGTTTCGCT ACCGTAGCGC AGGCCGACTA 

CGCTAACGTC ACCGTGACCG ACCAAAGCGA TGGCATCGCG TCCGGCTGAT 


EcoRV 


1451 CAAAGATATC GTGATGACCC AGTCTCCAGC AATCATGTCT ACATCTCTAG 
GTTTCTATAG CACTACTGGG TCAGAGGTCG TTAGTACAGA TGTAGAGATC 

1501 GGGAACGGGT CACCATGACC TGCACTGCCA GTTCAAGTGT AAGTTCCTCT 
CCCTTGCCCA GTGGTACTGG ACGTGACGGT CAAGTTCACA TTCAAGGAGA 

1551 TACTTACACT GGTACCAGCA GAAGCCAGGA TCCTCCCCCA AACTCTGGAT 
ATGAATGTGA CCATGGTCGT CTTCGGTCCT AGGAGGGGGT TTGAGACCTA 

1601 TTATAGCACA TCCAACCTGG CTTCTGGAGT CCCAACTCGC TTCAGTGGCA 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence (continued) 

AATATCGTGT AGGTTGGACC GAAGACCTCA GGGTTGAGCG AAGTCACCGT 

1651 GTGGGTCTGG GACCTCTTAC TCTCTCACAA TCAGCACCAT GGCGGCTGAG 
CACCCAGACC CTGGAGAATG AGAGAGTGTT AGTCGTGGTA CCGCCGACTC 

1701 GATGCTGCCA CTTATTACTG CCACCAGTAT CATCGTTTCC CACCCACGTT 
CTACGACGGT GAATAATGAC GGTGGTCATA GTAGCAAAGG GTGGGTGCAA 

1751 CGGAGGGGGG ACCAAGCTGG AAATAAAACG GGCTGGTGGT GGTGGTTCTG 
GCCTCCCCCC TGGTTCGACC TTTATTTTGC CCGACCACCA CCACCAAGAC 

1801 GCGGCGGCGG CTCCGGTGGT GGTGGTTCTG AAGTTAAACT GGTCGAGTCT 
CGCCGCCGCC GAGGCCACCA CCACCAAGAC TTCAATTTGA CCAGCTCAGA 

1851 GGAGGAGGCT TGGTGCAACC TGGAGGATCC ATGAAACTCT CCTGTGTTGC 
CCTCCTCCGA ACCACGTTGG ACCTCCTAGG TACTTTGAGA GGACACAACG 

1901 CTCTGGAATC ACTTTCAGTA ATTACCGGAT GAACTGGGTC CGCCAGTCTC 
GAGACCTTAG TGAAAGTCAT TAATGGCCTA CTTGACCCAG GCGGTCAGAG 

1951 CAGAGAAGGG GCTTGAGTGG GTTGCTGAAA TTAGATTGAA ATCTAATAAT 
GTCTCTTCCC CGAACTCACC CAACGACTTT AATCTAACTT TAGATTATTA 

2001 TATGCAACAC ATTATGCGGA GTCTGTGAAA GGGAGGTTCA CCATCTCAAG 
ATACGTTGTG TAATACGCCT CAGACACTTT CCCTCCAAGT GGTAGAGTTC 

2051 AGATGATTCC AAAAGTAGTG TCTACCTGCA AATGAACAAC TTAAGAGCTG 
TCTACTAAGG TTTTCATCAC AGATGGACGT TTACTTGTTG AATTCTCGAC 

2101 AAGACACTGG CATTTATTAC TGTAGAGGGG TTTCATATAC TATAGACTAC 
TTCTGTGACC GTAAATAATG ACATCTCCCC AAAGTATATG ATATCTGATG 

EcoRI 


2151 TGGGGTCAAG GAACCTCAGT CACAGTCTCC TCAGAATTCG AGCAGAAGCT 

ACCCCAGTTC CTTGGAGTCA GTGTCAGAGG AGTCTTAAGC TCGTCTTCGA 

2201 GATCTCTGAG GAAGACCTGT AGGCATGCTT ATTTGTTTGT GAATATCAAG 

CTAGAGACTC CTTCTGGACA TCCGTACGAA TAAACAAACA CTTATAGTTC 

2251 GCCAATCGTC TGACCTGCCT CAACCTCCTG TCAATGCTGG CGGCGGCTCT 

CGGTTAGCAG ACTGGACGGA GTTGGAGGAC AGTTACGACC GCCGCCGAGA 

2301 GGTGGTGGTT CTGGTGGCGG CTCTGAGGGT GGTGGCTCTG AGGGTGGCGG 

CCACCACCAA GACCACCGCC GAGACTCCCA CCACCGAGAC TCCCACCGCC 

2351 TTCTGAGGGT GGCGGCTCTG AGGGAGGCGG TTCCGGTGGT GGCTCTGGTT 

AAGACTCCCA CCGCCGAGAC TCCCTCCGCC AAGGCCACCA CCGAGACCAA 

2401 CCGGTGATTT TGATTATGAA AAGATGGCAA ACGCTAATAA GGGGGCTATG 

GGCCACTAAA ACTAATACTT TTCTACCGTT TGCGATTATT CCCCCGATAC 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence (continued) 

2451 ACCGAAAATG CCGATGAAAA CGCGCTACAG TCTGACGCTA AAGGCAAACT 
TGGCTTTTAC GGCTACTTTT GCGCGATGTC AGACTGCGAT TTCCGTTTGA 

2SCI TGATTCTGTC GCTACTGATT ACGGTGCTGC TATCGATGGT TTCATTGGTG 
ACTAAGACAG CGATGACTAA TGCCACGACG ATAGCTACCA AAGTAACCAC 

2551 ACGTTTCCGG CCTTGCTAAT GGTAATGGTG CTACTGGTGA TTTTGCTGGC 
TGCAAAGGCC GGAACGATTA CCATTACCAC GATGACCACT AAAACGACCG 

2601 TCTAATTCCC AAATGGCTCA AGTCGGTGAC GGTGATAATT CACCTTTAAT 
AGATTAAGGG TTTACCGAGT TCAGCCACTG CCACTATTAA GTGGAAATTA 

2651 GAATAATTTC CGTCAATATT TACCTTCCCT CCCTCAATCG GTTGAATGTC 
_.GTTATTAAAG GCAGTTATAA ATGGAAGGGA GGGAGTTAGC--CAACTTACAG 

2701 GCCCTTTTGT CTTTGGCGCT GGTAAACCAT ATGAATTTTC TATTGATTGT 
CGGGAAAACA GAAACCGCGA CCATTTGGTA TACTTAAAAG ATAACTAACA 

2751 GACAAAATAA ACTTATTCCG TGGTGTCTTT GCGTTTCTTT TATATGTTGC 
CTGTTTTATT TGAATAAGGC ACCACAGAAA CGCAAAGAAA ATATACAACG 

2601 CACCTTTATG TATGTATTTT CTACGTTTGC TAACATACTG CGTAATAAGG 
GTGGAAATAC ATACATAAAA GATGCAAACG ATTGTATGAC GCATTATTCC 

HindiII 

2851 AGTCTTGATA AGCTTGACCT GTGAAGTGAA AAATGGCGCA CATTGTGCGA 
TCAGAACTAT TCGAACTGGA CACTTCACTT TTTACCGCGT GTAACACGCT 

2901 CATTTTTTTT GTCTGCCGTT TACCGCTACT GCGTCACGGA TCCCCACGCG 
GTAAAAAAAA CAGACGGCAA ATGGCGATGA CGCAGTGCCT AGGGGTGCGC 

2951 CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT 
GGGACATCGC CGCGTAATTC GCGCCGCCCA CACCACCAAT GCGCGTCGCA 

3001 GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC 
CTGGCGATGT GAACGGTCGC GGGATCGCGG GCGAGGAAAG CGAAAGAAGG 

3051 CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG 
GAAGGAAAGA GCGGTGCAAG CGGCCGAAAG GGGCAGTTCG AGATTTAGCC 

3101 GGCATCCCTT TAGGGTTCCG ATTTAGTGCT TTACGGCACC TCGACCCCAA 
CCGTAGGGAA ATCCCAAGGC TAAATCACGA AATGCCGTGG AGCTGGGGTT 

3151 AAAACTTGAT TAGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA 
TTTTGAACTA ATCCCACTAC CAAGTGCATC ACCCGGTAGC GGGACTATCT 

3201 CGGTTTTTCG CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC 
GCCAAAAAGC GGGAAACTGC AACCTCAGGT GCAAGAAATT ATCACCTGAG 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence (continued) 

3251 TTGTTCCAAA CTGGAACAAC ACTCAACCCT ATCTCGGTCT ATTCTTTTGA 
AACAAGGTTT GACCTTGTTG TGAGTTGGGA TAGAGCCAGA TAAGAAAACT 

3301 TTTATAAGGG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA AATGAGCTGA 
AAATATTCCC TAAAACGGCT AAAGCCGGAT AACCAATTTT TTACTCGACT 

3351 TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC GTTTACAATT 
AAATTGTTTT TAAATTGCGC TTAAAATTGT TTTATAATTG CAAATGTTAA 

3401 TCAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAAGCCCTA TTTGTTTATT 
AGTCCACCGT GAAAAGCCCC TTTACACGCG CCTTGGGGAT AAACAAATAA 

3451 TTTCTAAATA CATTCAAATA TGTATCCGCT CATGTCGAGA CGTTGGGTGA 
AAAGATTTAT GTAAGTTTAT AGATAGGGGA-GTAGAGCTCT GCAAGGCAGT 

3501 GGTTCCAACT TTCACCATAA TGAAATAAGA TCACTACCGG GCGTATTTTT 
CCAAGGTTGA AAGTGGTATT ACTTTATTCT AGTGATGGCC CGCATAAAAA 

3551 TGAGTTATCG AGATTTTCAG GAGCTAAGGA AGCTAAAATG GAGAAAAAAA 
ACTCAATAGC TCTAAAAGTC CTCGATTCCT TCGATTTTAC CTCTTTTTTT 

3601 TCACTGGATA TACCACCGTT GATATATCCC AATGGCATCG TAAAGAACAT 
AGTGACCTAT ATGGTGGCAA CTATATAGGG TTACCGTAGC ATTTCTTGTA 

3651 TTTGAGGCAT TTCAGTCAGT TGCTCAATGT ACCTATAACC AGACCGTTCA 
AAACTCCGTA AAGTCAGTCA ACGAGTTACA TGGATATTGG TCTGGCAAGT 

3701 GCTGGATATT ACGGCCTTTT TAAAGACCGT AAAGAAAAAT AAGCACAAGT 
CGACCTATAA TGCCGGAAAA ATTTCTGGCA TTTCTTTTTA TTCGTGTTCA 

3751 TTTATCCGGC CTTTATTCAC ATTCTTGCCC GCCTGATGAA TGCTCATCCG 
AAATAGGCCG GAAATAAGTG TAAGAACGGG CGGACTACTT ACGAGTAGGC 

3801 GAGTTCCGTA TGGCAATGAA AGACGGTGAG CTGGTGATAT GGGATAGTGT 
CTCAAGGCAT ACCGTTACTT TCTGCCACTC GACCACTATA CCCTATCACA 

3851 TCACCCTTGT TACACCGTTT TCCATGAGCA AACTGAAACG TTTTCATCGC 
AGTGGGAACA ATGTGGCAAA AGGTACTCGT TTGACTTTGC AAAAGTAGCG 

3901 TCTGGAGTGA ATACCACGAC GATTTCCGGC AGTTTCTACA CATATATTCG 
AGACCTCACT TATGGTGCTG CTAAAGGCCG TCAAAGATGT GTATATAAGC 

3951 CAAGATGTGG CGTGTTACGG TGAAAACCTG GCCTATTTCC CTAAAGGGTT 
GTTCTACACC GCACAATGCC ACTTTTGGAC CGGATAAAGG GATTTCCCAA 

4001 TATTGAGAAT ATGTTTTTCG TCTCAGCCAA TCCCTGGGTG AGTTTCACCA 
ATAACTCTTA TACAAAAAGC AGAGTCGGTT AGGGACCCAC TCAAAGTGGT 

4051 GTTTTGATTT AAACGTGGCC AATATGGACA ACTTCTTCGC CCCCGTTTTC 
CAAAACTAAA TTTGCACCGG TTATACCTGT TGAAGAAGCG GGGGCAAAAG 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence (continued) 

4101 ACCATGGGCA AATATTATAC GCAAGGCGAC AAGGTGCTGA TGCCGCTGGC 
TGGTACCCGT TTATAATATG CGTTCCGCTG TTCCACGACT ACGGCGACCG 

4151 GATTCAGGTT CATCATGCCG TCTGTGATGG CTTCCATGTC GGCAGAATGC 
CTAAGTCCAA GTAGTACGGC AGACACTACC GAAGGTACAG CCGTCTTACG 

Seal 


4201 TTAATGAATT ACAACAGTAC TGCGATGAGT GGCAGGGCGG GGCGTAATTT 
AATTACTTAA TGTTGTCATG ACGCTACTCA CCGTCCCGCC CCGCATTAAA 

4251 TTTTAAGGCA GTTATTGGTG CCCTTAAACG CCTGGTGCTA CGCCTGAATA 
AAAATTCCGT CAATAACCAC GGGAATTTGC GGACCACGAT GCGGACTTAT 

4301 AGTGATAATA AGCGGATGAA TGGCAGAAAT TCGAAAGCAA ATTCGACCCG 
TCACTATTAT TCGCCTACTT ACCGTCTTTA AGCTTTCGTT TAAGCTGGGG.' 

4351 GTCGTCGGTT CAGGGCAGGG TCGTTAAATA GCCGCTTATG TCTATTGCTG 
CAGCAGCCAA GTCCCGTCCC AGCAATTTAT CGGCGAATAC AGATAACGAC 

4401 GTTTACCGGT TTATTGACTA CCGGAAGCAG TGTGACCGTG TGCTTCTCAA 
CAAATGGCCA AATAACTGAT GGCCTTCGTC ACACTGGCAC ACGAAGAGTT 

4451 ATGCCTGAGG CCAGTTTGCT CAGGCTCTCC CCGTGGAGGT AATAATTGCT 
TACGGACTCC GGTCAAACGA GTCCGAGAGG GGCACCTCCA TTATTAACGA 

4501 CGACATGACC AAAATCCCTT AACGTGAGTT TTCGTTCCAC TGAGCGTCAG 
GCTGTACTGG TTTTAGGGAA TTGCACTCAA AAGCAAGGTG ACTCGCAGTC 

4551 ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC 
TGGGGCATCT TTTCTAGTTT CCTAGAAGAA CTCTAGGAAA AAAAGACGCG 

4601 GTAATCTGCT GCTTGGAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG 
CATTAGACGA CGAACGTTTG TTTTTTTGGT GGCGATGGTC GCCACCAAAC 

4651 TTTGCCGGAT CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA 
AAACGGCCTA GTTCTCGATG GTTGAGAAAA AGGCTTCCAT TGACCGAAGT 

4701 GCAGAGCGCA GATACCAAAT ACTGTCCTTC TAGTGTAGCC GTAGTTAGGC 
CGTCTCGCGT CTATGGTTTA TGACAGGAAG ATCACATCGG CATCAATCCG 

4751 CACCACTTCA AGAACTCTGT AGCACCGCCT ACATACCTCG CTCTGCTAAT 
GTGGTGAAGT TCTTGAGACA TCGTGGCGGA TGTATGGAGC GAGACGATTA 

4801 CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT CTTACCGGGT 
GGACAATGGT CACCGACGAC GGTCACCGCT ATTCAGCACA GAATGGCCCA 

4851 TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG 
ACCTGAGTTC TGCTATCAAT GGCCTATTCC GCGTCGCCAG CCCGACTTGC 

4901 GGGGGTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT 
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Figure 25: pTERMsc2H10myc3sCAM: functional map and 
sequence (continued) 

CCCCCAAGCA CGTGTGTCGG GTCGAACCTC GCTTGCTGGA TGTGGCTTGA 

4951 GAGATACCTA CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA 

CTCTATGGAT GTCGCACTCG ATACTCTTTC GCGGTGCGAA GGGCTTCCCT 

5001 GAAAGGCGGA CAGGTATCCG GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC 

CTTTCCGCCT GTCCATAGGC CATTCGCCGT CCCAGCCTTG TCCTCTCGCG 

5051 ACGAGGGAGC TTCCAGGGGG AAACGCCTGG TATCTTTATA GTCCTGTCGG 

TGCTCCCTCG AAGGTCCCCC TTTGCGGACC ATAGAAATAT CAGGACAGCC 

5101 GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC TCGTCAGGGG 

CAAAGCGGTG GAGACTGAAC TCGCAGCTAA AAACACTACG AGCAGTCCCC 

5151 GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG 

CCGCCTCGGA TACCTTTTTG CGGTCGTTGC GCCGGAAAAA TGCCAAGGAC 

5201 GCCTTTTGCT GGCCTTTTGC TCACATG 

CGGAAAACGA CCGGAAAACG AGTGTAC 
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Table 1: Phagemids Constructed for Experiments 2 and 3 
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Table 2: Results of Experiment 2 (see Figure 7) 


WO 97/32017 


PCT/EP97/00931 



43/44 

SUBSTITUTE SHEET (RULE 26) 


PYING1-C2 + pYANG3-ApeX 
pYlNGI-CI + pYANG3-A 
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Table 3: Results of Experiment 4 (see Figure 19) 

Table 3a: Identification of phage/plasmid present in 
individual clones 


Combination 

Cione(s) 

fhaglA + pUC19/IMPhag 

#9 

fpep3 1b + pUC18/IMP-p75 

#1 ,#3,#5,#6,#7,#13,#15,#19 

fpep3 1b + pUC19/IMPhag 

#14 

unusual DNA 

#2,#4,#8,#10,#11 ,#12,#16,#17,#18 


Table 3b: Test for infectivity of individual clones 


Clone # 

Titer (transducing units/ml) 

1 

2 x 10E 4 

2 

31 

3 

1 x10E5 

4 

1 x 10E5 

5 

1 x 10E5 

6 

2 x 10E3 

7 

1 x 10E4 

8 

1 x10E5 

9 

1 x10E6 

10 

1 x 10E4 

11 - 7 '' - 

1 x10E3 

12 

1 x 10E4 

13 

3 x 10E3 

14 

<10 

15 

5x10E4 

16 

1 x 10E4 

17 

5 x 10E2 

18 

1 x 10E4 

19 

1 x 10E5 
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